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«•  INTRODUCTION 

MARC  I E  --  Meaning  Analysis,  Response  Generation,  and  Inference 
m  English  --  is  a  model  of  natural  language  processing  incorporated 
m  a  computer  program  now  running  at  the  Artificial  Intelligence 
Laboratory  at  Stanford  University.  The  program  contains  the  core 
processes  language  analysis,  memory  model,  and  language  generation 
—  necessary  for  several  natural  language  tasks.  Its  operation  in 
one  task  domain,  sentence  paraphrasing,  is  the  topic  of  this  report. 

The  task  of  paraphrasing  English  sentences  can  be  stated  simply 
as  f o l I ous: 

Given  an  English  sentence,  produce  other  sentences  which 

English  speakers  interpret  as  having  the  same  meaning. 

Of  course  the  notion  of  ‘meaning’  is  a  very  vague  one  and  the 
only  test  available  for  the  acceptability  of  a  proposed  paraphrase  of 
a  sentence  is  to  ask  native  speakers  whether  the  two  sentences  ‘mean’ 
the  same  thing.  Fortunately  speakers  seem  to  agree  on  the  meaning  of 
an  isolated  sentence,  at  least  to  a  considerable  amount  of  detail; 
the  question  of  whether  the  sentences  we  generate  are  paraphrases 
according  to  some  more  formal  definition  of  ‘meaning'  and 
‘paraphrase’  wi  i  I  not  concern  us  here. 


The  following  examples.uhich  the  program  produce?,  should  give 


the  reader  a  better  feeling  for  this  notion  of  sentence  paraphrasing: 


Source: 

Par aphrsse: 

Source: 

Paraphrase: 

Source: 

Paraphrase: 

Source: 

Paraphrase: 


JOHN  CAVE  MARY  A  BICYCLE. 

MARY  RECEIVED  A  BICYCLE  FROM  JOHN. 

JOHN  ADVISED  MARY  TO  DRINK  THE  UINE, 

JOHN  TOLD  MARY  SHE  WOULD  LIKE  TO  DRINK  THE  WINE. 
MARY  UANTS  TO  CHOKE  FRED. 

MARY  BELIEVES  SHE  UOULD  ENJOY  PREVENTING  FREO  FROM 
BREATHING  BY  GRABBING  Hi'S  NECK. 

JOHN  PREVENTED  MARY  FROM  GIVING  BILL  THE  BOOK  BY 
GIVING  THE  BOOK  TO  FRED. 

DILL  MAS  UNABLE  TO  GET  THE  BOOK  FROM  MARY  BECAUSE 
JOHN  GAVE  FRED  THE  BOOK. 


Source: 

Par  aptv-ase: 
Paraphrase: 

Source: 

Paraphrase: 

Source: 

Paraphrase: 


JOHN  KILLED  MARY  BY  CHOKING  HER. 

JOHN  STRANGLED  MARY. 

JOHN  CHOKED  MARY  AND  SHF  DIED  BECAUSE  SHE  LIAS  UNABLE 
TO  BREATHE, 

JOHN  TOLD  MARY  HE  UOULD  HIT  HER  UITH  HIS  FOOT. 

JOHN  THREATENED  TO  KICK  MARY. 

JOHN  LOANEO  A  BICYCLE  TO  MARY. 

JOHN  GAVE  MARY  A  BICYCLE  AND  HE  EXPECTS  SHE  MILL 
RETUHN  IT  TO  HIM. 


Each  of  these  examples  is  handed  by  the  current  program,  the 
‘Source’  coming  from  a  human  user,  the  ‘Paraphrase’  being  produced  by 
the  program.  The  program  does  not  handle  pronouns  in  the  input;  the 


fifth  example  would  be  typed  in  as  "JOHN  KILLED  MARY  BY  CHOKING 
MARY",  There  are  also  some  minor  distinctions  in  form  between  the 


output  produced  bu  MARGIE  and  the  paraphrase*  in  these  e.a.ples, 
ai though  in  all  case  tha  sentences  produced  are  close  to  those  shoun 
and  use  the  sane  uords.  A  co.plete  listing  of  the  programs  actual 

Per formance  on  several  exa.pl.,  is  provided  at  the  end  of  this 
repor  t , 

Ths  ability  to  iphrase  single  sentences  is  not  itself 
particularly  interesting,  pr i mar i I y  because  of  its  artificiality. 
Cerf^nly  humans  have  the  ability  to  create  sentence  paraphrases,  as 
has  been  demonstrated  in  psycho! inguistic  research  f4) .  It  is  an 
ability,  however ,  which  is  seldom  used  outside  of  experimental 
contexts.  (Trying  to  explain  the  meaning  of  a  complicated  sentence 
to  a  non-native  speaker  or  a  child  is  one  natural  use  of  this 
ability.)  Furthermore,  none  o*  the  commonly  proposed  computational 
tasks  which  deal  with  natural  language  processing  directly  involve 
sentence  paraphrasing. 

Nevertheless  sentence  paraphrasing,  at  least  as  defined  and 
accomplished  by  the  mode!  described  in  this  report,  is  an  interesting 
research  task  for  sever r I  reasons: 

AI  MARGIE'S  Mthod  of  per  for  filing  this  (ash  can  vieued  „ 

l^PtUmALJAWlURrifUIiffl.  This  is  because  ,he  paraphrases 
are  not  produced  bg  directly  converting  patte-ns  in  the  source 
sentence  into  patterns  in  the  paraphrase  sentence.  Rather,  the 
inputs  are  first  converted  into  a  language- tree  (conceptual! 
-epresentation  of  their  meaning.  This  representation  alone  is  then 
used  to  produce  paraphrase  sentenced).  Neither  the  Herds  nor  the 
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syntax  of  the  Input  are  considered  in  their  generation.  The  same 
language-free  representation  produced  by  the  analyzer  for  the 
paraphrase  task  could  be  used  to  generate  German  or  French 
realizations  as  well  as  English.  To  perform  translation  from  English 
to  a  second  language  it  would  not  be  necessary  to  alter  the  input- 
output  behavior  of  the  analysis  algorithm.  The  generation  algorithm 
would  need  to  be  provided  with  ali  the  necessary  linguistic  data  for 
the  target  language.  Analogously,  the  generation  algorithm  used  for 
English  sentence  paraphrasing  could  be  used  in  conjunction  with  a 
German  analysis  routine  to  perform  German-Eng i i sh  translation. 
Although  neither  the  analysis,  generation,  or  i„eraory  model  of  the 
current  program  is  powerful  enough  to  be  used  yet  for  the  translation 
of  interesting  text,  this  .s  not  due  tn  a  theoretical  difference  in 
the  mechanisms  involved  in  translation  and  paraphrase.  flANGic  is 
designed  to  handle  both  these  tasks  and  others  (question  answering, 
conversation)  in  three  stages: 

•  -  Produce  a  conceptual  representation  of  the  meaning  of  the  input 
!!:  dec,de  on  the  ’conceptual’  content  of  the  response. 

1  M*  produce  a  language  response  which  expresses  this  meaning. 

The  second  of  these  stages  is  pro&aoly  the  least  understood  of  these 
processes.  Paraphrase  and  translation  are  closely  related  precisely 
because  the  same  simple  algorithm  can  be  used  for  this  a+ep;  namely, 
the  representation  produced  by  (i)  can  itself  be  used  as  the 
conceptual  content  passed  to  (iii). 

B)  aphrases  can  demonstrate  UNQERSTANQiNG.  it  is  possible  to 
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obtain  many  paraphrases  by  syntactic  manipulations.  A  sentence  with 
a  subject  anri  a  direct  object  can  be  put  in  active  or  passive  voice, 
yielding  such  pai  aphrases  as 

"John  threw  the  ball  to  Mary" 

The  ball  was  thrown  to  Mary  by  John" 

Such  paraphrases  could  be  produced  without  the  conceptual  analysis 
performed  by  MARGIE.  A  computer  implementation  of  a  transformational 
grammar  could  certainly  do  this.  But  no  one  would  claim  that  such 
‘syntactic*  paraphrases  demonstrate  understanding. 

More  interesting  paraphrases  result  from  situations  in  uhich 
two  words  may  be  used  interchangeably  but  require  a  change  in  the 
syntax  of  the  sentence: 

"The  university  owns  the  land" 

"The  land  belongs  to  the  university" 

It  might  appear  that  these  transformations  could  be  handled  by  ‘word 
sensitive’  transformational  rules.  But  they  actual ly  require  an 
analysis  which  finds  'semantic  senses’  of  words,  as  is  demonstrated 
by  the  paraphrase  relation: 

”1  sold  the  Chevy  to  Fred” 

M 

"Fred  purchased  the  Chevy  from  me" 

but  lack  of  paraphrase  relation  between 

"I  sold  ir.y  idea  to  the  management"  and 
"The  management  purchased  my  idea  from  ne" 

Such  paraphrases  thus  require  semantic  disambiguation  of  words,  a 

problem  which,  in  much  generality,  is  still  beyond  the  capabilities 
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of  current  language  processing  programs.  Since  it  is  generally 
recognized  that  the  solution  to  this  problem  requires  some  sort  of 
understanding  by  the  program,  a  system  which  produces  these 
paraphrases  in  the  appropriate  contexts  demonstrates  9ome  sort  of 
understanding. 

The  need  for  disambiguation  in  paraphrasing  can  be  seen  even 
more  clearly  in  the  sort  of  paraphrase  which  breaks  a  word  down  into 
its  ‘components’.  Ue  might  paraphrase 

"Jerry  dropped  '.he  lanp"  with 

"Jerry  let  go  of  the  lamp  which  allowed  the  lamp  to  fall" 
but  we  would  not  want 

"Jerry  dropped  five  dollars  at  the  race  track" 
paraphrased  analogously,  at  least  for  the  primary  reading  of  this 
sentence. 

Even  supposing  the  disambiguation  problem  were  solved  (or 
eliminated,  by  suitably  restricting  vocabulary  and  context),  these 
component  based  paraphrases  introduce  a  new  problem.  The  same  sort 
of  mechanism  which  handled  semantic  synonymy  might  also  handle  the 
paraphrase: 

Source:  "fly  friend  advised  me  to  visit  Spain" 

Paraphrase:  "Hy  friend  told  me  I  would  enjoy  visiting  Spain" 

Suppose  however  it  was  desired  to  produce  the  above  Source  given  tns 
Paraphrase,  Instead  of  simply  recognizing  the  pattern  ‘advise’  and 
appsyir.g  a  transformat ;on.  it  is  required  that  the  pattern  *te(l  X 
(that)  X  would  enjoy  .  .  ."be  found  in  the  analysis  of  the  input. 


S 


To  do  this  efficiently  requires  increased  soph istlcat ion  in  a  pattern 
matcher.  Furthermore,  i,  ue  uish  to  get  •advise’  as  a  paraphrase  of 
'tell  K  (that)  X  mould  like  an0  •suggest  to  X  (that)  .  .  . 

mould  please  X’  it  is  apparent  that  matching  syntactic  patterns  ot 
“  rd  senses  mould  rapidly  run  into  problems  from  the  quantity  of 

patterns  needed.  This  problem  is  avoided  in  MARGIE'S  method  of 
paraphrasing. 

MARGIE  has  no  rules  uhich  specify  explicit  paraphrase 
relation,  betneen  patterns  of  uord  senses.  Given  that  MARGIE's 
paraphrase,  are  produced  Iron:  a  language  free  represents! ion,  of 
course,  no  such  patterns  even  exist.  MARGIE  searches  instead  for 
conceptual  patterns.  These  are  dependent  on  the  meaning  „f  ,he 
source  sentence,  but  no,  on  the  oarticular  uords  or  syntax  used.  The 
patterns  sough,  are  no  more  complex  than  those  uhich  could  be  needed 
for  the  component  based  paraphrases  above,  and  the  number  of  patterns 
uhich  must  be  discriminated  is  much  smaller. 

Finally,  there  is  a  form  of  naraphrese  uhich  is  not  even 
theoretically  obtainable  through  uord  or  uord  sense  pattern  matching, 

ant,  uhich  demonstrates  even  mote  clearly  a  sort  of  understanding. 

For  instance, 

Paraphrase-  !°ld  Fred  he  ,JOulcJ  bomb  his  office" 

araphrase.  uahn  threatened  tc  bomb  Freci's  office" 

cannot  realistically  be  produced  by  finding  a  pattern  involving 
•tell’  and  -bomb’,  since  there  are  an  i,  .nite  number  of  things  uhich 
John  could  tell  Fred  that  uould  constitute  a  threat.  Although  MARGIE 
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cannot  perform  all  the  functions  necessary  to  produce  such 
paraphrases,  it  does  have  tne  required  I innuistic  mechanisms.  This 
point  will  be  discussed  further  in  section  IV, 

In  the  absence  o  any  dear  notion  of  what  ‘understanding’  is, 
it  is  pointless  to  clc,  m  that  the  production  of  a  given  paraphrase 
demonstrates  a  capar.tij  for  understanding.  Ue  will  describe  the 
processes  by  which  MARGIE  obtains  such  paraphrases  and  leave  it  for 
the  reader  to  consider  uhetber  this  meets  his  standards  for 
classification  as  ‘understanding', 

C)  MARGIE’S  paraphrase  production  exhibits  a  use  of  CONTEXT  in 

language  processing.  One  of  the  most  common  ci  t tic  isms  of  natural 
language  research  is  the  tendency  to  deal  with  example  sentenres 
outside  of  ary  nontext,  whereas  human  language  processing  always 
occurs  in  complex  social  and  linguistic  contexts  which  affect  both 
analysis  and  generation. 

The  paraphrasing  of  sirgle  sentences  seems  to  shirs  this  fault, 
but  the  model  described  here  performs  all  analysis  and  generation  in 
the  context  of  a  memory  model,  comprising  facts,  beliefs,  and  ruies 
which  arc  actively  used  during  the  paraphrase  process.  Fur tr.ermore, 
tne  information  contain  m  the  natural  language  sentences  being 
analyzed  can  be  added  to  this  memory  mode!  and  affect  the  production 
of  paraphrases  of  later  sentences.  Although  MARGIE  does  not  use 
linguistic  context  (the  particular  words  or  syntactic  forms  present 
in  the  input  sentence)  in  its  generate  process,  a  limitation  not 
shared  by  humans,  it  does  use  the  non-  nguistic  context  -esent  in 


the  memory  model  !ected  by  the  ’conceptual  content’  of  the 

linguistic  context.  For  instance,  if  MARGIE  has  been  told 

"Bill  had  th?  hook11,  and 
"Mary  has  the  bock", 
and  is  then  asked  to  paraphrase 

"Mary  uili  give  the  book  to  Bill" 
i  t  can  pi  odure 

"Mary  ui  I  I  return  the  bock  to  Bill” 

Since  an  understanding  of  the  language  analyst  and  generation 
processes  described  in  this  report  requires  an  understandina  of  the 
nature  of  conceptual  representation  and,  to  a  lesser  degree,  a 
knowledge  of  the  particular  representations  used  by  MARGIE,  the  next 
section  will  be  devoted  to  representational  matters.  It  is  suggested 
that  readers  already  familiar  with  Schank’s  163  work  on  Conc'phiai 
Dependency  skip  Section  II  and  refer  back  to  it  for  explanations  of 
unfamiliar  terms  or  notations 
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CONCEPTUAL  REPRESENTATION 


Many  forms  of  representation  of  language  content  have  been 
proposed  by  computa t i ona I  and  theoretical  linguists.  Some  are 
‘syntax*,  or  form,  based;  others  ‘semantics',  or  meaning,  based, 
‘Conceptual’  representations  may  be  distinguished  from  others  in 
several  ways: 

(A)  A  conceptual  representation  is  ‘language-tree’  —  that  is,  the 
same  set  of  units  and  relations  are  used  to  describe  meanings 
which  may  be  encoded  in  any  human  language. 

(B)  The  representations  provided  for  natural  language  sentences 

which  are  ‘similar’  in  meaning  should  directly  exhibit  this 
‘similarity*.  Closeness  of  meaning  need  not  be  formally 
defined:  it  is  simply  the  feeling  of  speakers  of  English,  for 

instance,  that  ‘running’  and  ’walking’  are  closer  in  meaning 
than  ‘running’  and  'killing'. 

(C)  The  representations  are  oriented  toward  use  in  a  computational 
memory  mode1  and  inference  system.  One  ramification  of  this  is 
that  the  units  and  relations  used  to  represent  meanings  derived 
from  language  must  be  the  same  ones  used  for  internally 
generated  information. 

(D)  The  representations  are  proposed  as  psychological  models  of 
human  cognitive  structures. 

CONCEPTUAL  DEPENDENCY'  (C. D. )  is  a  conceptual  representation 
which  encompasses  a  particular  set  of  primitive  conceptual  units  and 
relations.  I4  has  been  developed  and  described  by  Schank  16,71.  It 
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is  not  the  purpose  of  this  report  to  give  arguments  favoring 
conceptual  systems  in  general  or  C.D.  in  particular.  Those 
interested  in  such  matters  nil  I  find  such  material  in  the  above 
references.  The  rest  of  this  section  is  devoted  to  a  brief  survey  of 
tho3e  aspects  of  C.D.  pertinent  to  the  remainder  of  the  report. 

(1)  EVENTS 

A^TS  and  ACTORS 

!n  C.D.  all  actions  described  in  language  are  broken  down  into 
a  set  of  primitive  ACTs.  ACTs  are  performed  by  ACTORs,  and  this 

relationship  is  symbolized: 

<ACT0R>  <«==>  <ACT> 

'Eating'  is  represented  by  the  primitive  ACT  '*!NGEST*';  ‘John 
eats'  is  represented  as: 

♦JOHN*  <===>  $  INGE  ST* 

Not  all  ACTOR-ACT  relationships  describe  physical  events; 
'giving'  is  an  abstract  notion  involving  change  of  possession  and  is 
represented  by  the  AC*  '*ATRAN_*'.  For  'John  gives’  we  have  the 
representat i on: 

♦JOHN*  <*»»>  *ATRANS* 

C.D.  CASES 

The  concepts  of  'eating'  and  'giving'  involve  more  than  just 
ACTORS  and  ACTs.  One  must  eat  or  give  some  physical  object.  An 
object  cannot  just  be  given  bu  an  ACTCR;  there  must  also  be  some 
recipient  of  the  giving.  To  represent  relationships  between  ACTs  and 
entities  other  than  ACTORs,  C.D.  provides  a  set  of  conceptual  CASEs. 
Each  ACT  requires  the  presence  of  a  particular  subse*  of  CASEs, 
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Most  ACTs  require  an  OBJECTIVE  case  symbolized: 

0 

i -  <GBJECT> 

Examples  of  this  relationship  include: 

0 

"John  drinks  milk"  *J0HN*  <»■■>  * INGEST*  * -  *i1ILK* 

0 

"Ered  breathes"  *FRED*  <=>=*>  *!NGEST*  < -  *A!R* 


(the  latter  example  demonstrates  hou  required  conceptual  cases 
will  be  present  in  representat ions  even  if  no  con  esponding  surface 
case  exists) 

When  the  'possession-ship*  of  an  object  is  changed  by  an 
action,  there  must  be  both  a  00W0R  and  a  RECIPIENT  of  the  possession 
The  RECIPIENT  CASE  is  provided  to  represent  this  relationship,  and  i 
denoted 

R  I - >  <REC I P I ENT  > 

- - 1 

I— — <  <  DONOR  > 

The  ACT  *A TRAMS*  requires  the  RECIPIENT  CASE.  Some  examples: 

"John  (Jives  Nary  a  book",  or 
Mary  receives  a  beos  from  John" 

0  R  | - >  *J1ARY* 

♦JOHN*  <«■«<=>  *A7RANS*  - - *300K*  « - 1 

i - <  *J0HN* 


"John  takes  the  book  from  Mary" 

0  R  | - >  *JQHN* 

♦JOHN*  <■««>  *ATRANS*  « - *B0QK*  * - 1 

| - <  *f1ARY* 


The  ACT  *PTRANS*  is  used  to  represent  acvions  of  changing 
location.  *PTRANS*  requires  an  OBJECT  (whose  location  is  changed) 
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and  a  SOURCE  and  GOAL  'ocation.  The  DIRECTIVE  case  provides  slots 


for  thpse  locations,  and  is  symbolized: 

0  i - >  <G0AL> 


I-— <  <  SOURCE  > 


"John  goes  to  the  store" 

D 

♦JOHN*  <*>  *PTRANS*  < - *J0HN* 


0  I— — >  tSTOPE* 
-I 


An  ACTOR-ACT  relationship,  together  with  all  the  cases  required 
by  the  ACT,  is  called  an  EVENT. 


(2)  STATEs  and  STATE -CHANGES 

Some  of  the  information  stored  in  a  memory  and  communicated  in 
language  is  not  represented  as  EVENTs,  but  as  STATEs.  The  notation 
used  in  C.O.  for  such  information  is: 

VAL 

<CCNCEPT>  <ssb>  <ATTRIBUTE>  - -  <VALUE> 

For  example,  "Fred  has  the  bock"  is  represented  as 

VAL 

*B0C<* ' <s==>  *P0SS*  - *FRED* 


A  subset  of  tne  ATTRIBUTES  used  in  C.O.  are  SCALES.  When  the 
ATTRIBUTE  of  a  STATE  relation  is  a  SCALE,  the  VALUE  will  be  an 
integer  representing  a  point  on  the  SCALE.  The  only  SCALEs  referred 
to  in  this  paper  are  *HEALTH*  {physical  health),  *J0Y*  (mental 
pleasure),  and  *ANGER*  (emotional  anger), 

"Socrates  is  dead" 

VAL 

♦SOCRATES*  <5555  xHEALTH*  - -  (-10) 
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B i I i  is  happu 


VAL 

*BILL*  <se5>  *JOY*  < -  (+3) 


In  other  cases,  changes  in  state  must  be  represented.  The 
STATE-CHANGE  notation  is: 


VAL 

| - >  < AT  TR I BU TE>  < - <neu-VALUE> 

<C0NCEPT>  <*sb|  VAL 

j - <  <ATTR1BUTE>  ♦ - <old--VALUE> 


or,  a! ternat i velu. 


VAL 


j - >  <ATTRIBUTE>  < - <neu-VALUE> 

/ - j 

<C0NCEPT>  | - 

\ - j  VAL 

j - <  <ATTRIBUTE>  - <ola-VALUE> 

Commonly  only  the  terminal  state  (ATTRIBUTE  +  neu-VALUE)  of  a 
STATE-CHANGE  relation  is  known,  and  we  will  not  bother  putting 
anything  in  the  initial  state  slot. 

"Socarates  dies" 


VAL 

J - >  *HEALTH*  «- - (-10) 

*S0CRATE5*  <==5 l 

I . < 


When  the  change  of  state  is  along  a  scale,  it  is  common  that 
neither  t?-e  precise  initial  nor  termina  ate  is  known,  but  only 
the  direction,  and  perhaps  amount,  ot  <-  ye.  A  STATE-CHANGE  can  be 
modified  by  an  INCrement  to  show  this: 

"Truman's  condition  deteriorates" 


j — — >  ^HEALTH* 
*TRUf1AN*  <sb=| 

t  I— — <  *HEALTH* 
I  NCI 
(-S) 
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EVENTs,  STATEs,  and  STATE-CHANGEs  are  a!)  types  of 
relationships  which  are  termed  ‘conceptualizations’. 


(3)  CAUSAL 9  and  CONJUNCTIONS 

Two  types  of  causal  relationship  will  be  used  in  examples.  The 

first  is  a  relation  in  which  the  occurrence  of  an  ANTECEDENT 

conceptualization  causes  a  RESULT  conceptualization; 

<ANTECEU:'.N!>  /  \ 

/  \  (the  causal  relation  symbol  ||| 

111  wii!  sometimes  be  written 

<RESULT>  <=  } 


An  example  of  the  use  of  the  causal  is: 


"John  killed  nary" 

♦JOHN*  <===>  *Q0* 
/  \ 


*(1ARY*  <ss==| 


VAL 

->  *HEALTH*  ♦ -  (-18) 


(*00*  i$  a  *di'  -y’  ACT  used  to  hoicf  the  place  o»  some  actual,  but 
unknown,  ACT  and  its  required  cases.) 

The  other  causal  relationship  provided  for  is  the  CAN-CAUSE 
relation: 

< ANTECEDENT >  /c\ 

/c\  (the  causal  relation  symbol  ||| 

Ml  wiM  sometimes  be  wri  t  ten 

<RE3ULT>  <=C  ) 

This  relation  indicates  that  the  occurrence  of  the  ANTECEDENT 
conceptualization  could  cause  the  RESULT  conceptualization,  but  does 
not  indicate  the  actual  occurrence  of  either. 


IS 


Mary  likes  to  eat  chocolate 


0 

♦MARY*  <=  =  *>*1NGEST*  ^CHOCOLATE* 
/c\ 


III  I - >  *JQY* 

♦MARY*  <  5  ?  2  = | 

t  j - — <  *J0Y* 

INC  | 

(+::) 


Both  CAUSAL  r  ‘aticnships  are  themselves  conceptualizations. 

Furthermore,  any  two  conceptualirat ions  can  be  joined  oy  the  symbol 

*/V  to  form  a  CONJUNCTION,  which  is  also  a  conceptualization. 

<conceptuai izaticn> 

/\ 

<conceptual  '■  z:;t  ion> 


(A)  Mental  ACTs 

Many  English  verbs  --  tell,  remember,  teach,  read  —  involve 
the  transfer  of  information.  Conceptual  primitives  for  representing 
these  meanings  are  discussed  in  (S3.  In  this  report  we  shall  use 
only  one  ‘mental'  ACT,  *MTRAN$*.  Tnis  act  reguires  a  new  CASE,  the 
MENTAL-OBJECT  (MOBJECT).  An  MOBJECT  must  itself  be  some 
conceptualization.  *M TRANS*  a'so  requires  the  °ECIPIENT  CASE,  with 
the  DONOR  and  RECIPIENT  being  ‘mental  locations.’  In  this  paper  we 
shall  limit  mental  locations  to  'conscious  processors'  (*CP**  and 
‘long-term  memories'  (*LTM*)  of  human  beir.gs,  and  physical  objects 
which  in  some  sense  serve  as  information  stores  (books,  televisions, 

.  .  The  notation  for  an  EVENT  using  *MTRANS*  is: 

H  R  |— - >  <RECIP1ENT> 

<ACT0R>  *M TRANS*  — <M0BJECT^ . | 

|— -<  <D0NCR> 

i£ 


♦MTRANS*  is  an  abstract  ACT  which  indicates  the  transfer  of  the 
information  contained  in  the  ttOBJECT  frcm  the  DONOR  to  the  RECIPIENT. 
"The  professor  tells  Bob  that  Socrates  is  dead" 


PART 

fl  R  I - >  *CP*  < - jicBOB* 

♦PROFESSOR*  <=>**>  sflTRANS*  < - #  - - 1  PART 

t  | - <  *CP*  < - ^PROFESSOR* 

I - i 

l  VAL 

*S0CRATES*  ^as3>  *HEALTH*  < -  (-18) 


PART 

(The  notation  *CD*  • - *B0B*  indicates  the  cmsc'ous  processor  of 

the  individual  *B0B*.  Unen  conceptual izat ions  are  embedded  in  other 
conceptual  izat  icns,  a  A  will  often  be  used  as  a  'piace  hoide1-'  and 
will  be  connected  to  the  main  relational  link  of  the  embedded 
con  ■  ep*ual  i  rot  ion, ) 

Mental  locations  can  also  t i *  I  the  < VALUE >  slot  cf  STATE 
relations  which  have  as  their  <AfTRIBUTE>  *MICC*  (Mental -LOCation1 . 
The  <C0NCfPT>  in  such  relations  must  be  an  entire  conceptualization. 
For  example: 


VAI.  PART 

U  <*e*>  *fiLQC*  * - *LTM*  < - *FRE0* 

T 


♦JOHN*  <=*=>  *D0* 
/  \ 


II!  I - >  *KEAuTH* 

-(.MARY*  <3SE£| 


VAL 


(-13) 


i 


-< 


represents  the  meaning  of  "Fred  believes  that  John  killed  Mary. 
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(5)  I.IME_s  and  other  nod  1  f  i  ca t  i ons 

Still  to  be  accounted  for  is  the  concept  of  the  time  of 

occurrence  of  an  event,  which  usually  is  reflected  by  verbal  tensing 

in  language.  tIARGIE  deals  only  with  points  in  time,  not  intervals. 

The  symbols  IT1.  T2,  T3,  .  .  .1  will  be  used  for  times,  and  drawn 

with  pointers  to  some  conceptual  link: 

T1 

i 

< ACTOR >  <===>  <ACT> 


The  special  symbol  *N3U*  repres'nis  the  ‘current*  time  —  i.e., 

h 

the  time  of  an  utterance  cr,  more  exr^tly,  tse  time  of  creation  of  a 

conceptualisation.  TIME  relations  will  be  shown  on  a  time  line,  left 

representing  PAST;  right,  FUTURE. 

_ _ (indicates  the  relations 

f  t ’  l  '  T1  <  T2  <  #NQU* 

T1  TC  *N'JU*  when*  '<’  means  ‘BEFORE’  ) 


In  the . i mp I ementat ion,  every  EVENT,  STATE,  and  STATE-CHANGE  has 
a  TIME  associated  with  it.  In  our  diagrams,  houever,  TIME  will  be 
left  out  unless  it  is  relevant  to  the  point  being  discussed. 

Another  modification  of  EVEN’Ts  is  LGCation.  It  will  be 
represented  simply  as  a  modification  on  the  main  link  of  the  EVCN'i : 

local  ion 
|  LOG 
i 

<ACT0R>  <=<==>  <ACT> 
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'Swimming'  would  be  represented  by: 

♦WATER* 

hoc 

i  0  0  I— -> 

X  <««»>  *PTRANS*  * - X  * —  | 

I - < 

Other  modifications  of  conceptualizations  permit  negation  and 
'questioning',  but  these  will  not  be  needed  for  understanding  tl.'s 
paper.  One  that  is  needed  is  the  MODE  'CANNOT'  which  can  modify  an 
EVENT,  and  is  symbolized  by  a  0  on  the  <*'••->. 

♦JOHN*  <**«=>  *DQ* 

/  \  PART 

III  fl  R  | — >  *CP*  ♦ - -  *HARY* 

♦NARY*  <-= .>  *f1TRANS*  - - *C0NCEP7S*  — — | 

0  i  — >  *B0DK* 

is  the  representation  provided  for  "John  prevented  Hary  from  reading 
the  bock.'1  ^CONCEPTS*  is  a  'dummy'  flOBJECT •  it  represents  ‘some 
unspecified  conceptual  information.') 

4-rj  conceptualization  may  be  modified  by  a  FOCUS  reiation. 
FOCUS  always  specifics  one  particular  slot  in  a  conceptualization, 
such  as  the  ACTOR  of  the  RESULT.  FOCUS  will  not  be  noted  in  our 
diagrams:  while  it  is  anticipated  that  the  memory  model  will  find 
uses  for  FOCUS,  it  is  currently  used  only  by  the  generation  routine 
to  choose  between  words  like  "give"  and  "receive". 

(6)  Lonceutua I  nom i na I s 


The  reader  may  have  wondered  about  the  use  of  units  *J0HN*. 
♦BOOK*,  etc.,  in  conceptua I i zat ions.  C.D.  has  provided  a  great  deal 


of  analysis  of  verbs  and  relations  found  in  language,  but  little 
analu  is  of  concrete  and  abstract  nominais.  The  current  program  does 
not  deal  with  words  like  "happiness"  and  "involvement",  but  is 
limited  to  nouns  which  name  hysical  objects  arid  people.  The  unit 
♦JOHN*  in  a  conceptualization  is  a  pointer  to  a  memory  node,  at  uhich 
are  pointers  to  all  conceptual izat ions  involving  *J0HN*,  as  well  as 
such  conceptual  information  as 

IHUIIAN  *J0HN*1  and  (HALE  *J0HN*) 

The  relation  most  used  by  the  paraphrasing  system,  however,  is 

(ENGL1SH-NAME  *J0HN*  JOHN) 

In  Conceptual  Dependency,  these  object -naming  units  are  termed 
‘PP's  (picture  producers).  Considerable  work  must  stil!  be  done  on 
defining  the  precise  nature  of  these  units,  both  theoretically  and 
computationally.  It  is  expected  that  future  versions  of  MARGIE  will 
include  extremely  different  handling  of  nominal  references. 
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111. 


ANALYSIS  OF  NATURAL  LANGUAGE  SENTENCES 


The  analyzer  described  belou  is  one  that  is  conceptually 
oriented  not  on'y  in  the  output  it  produces,  but  also  in  the  kinds  of 
processes  it  uses  to  achieve  its  ansuers.  Its  primary  task  is  not  to 
discover  the  syntactic  relat iooships  in  a  sentence,  but  to  discover 
what  that  sentence  is  communicating.  Syntactic  relationships  arc 
used  to  help  this  process.  This  distinguishes  the  analyzer  fro  ■ 
previous  attempts,  such  as  Woods’  parser  till,  where  the  semantics  was 
needed  to  help  buiid  the  syntactic  structures. 

For  the  analyzer,  the  job  of  discovering  uhat  a  sentence  is 
communicating  means  discovering  uhat  Conceptual  Dependency  network 
should  be  generated  from  that  sentence.  One  source  of  information 
used  to  do  this  is  a  simple  description  of  certain  relationships 
between  words.  But  more  important  than  such  patterns  between  word 
types  are  the  passive  features  and  active  expectations  that  are 
associated  with  each  word  in  a  language.  These  expectations  look  for 
certain  events,  certain  features  or  structures,  and  if  these  things 
are  found  then  certain  actions  are  performed. 

This  emphasis  on  uords  rather  than  on  syntactic  structures,  on 
content  rather  than  form,  is  in  keeping  with  the  general  philosophy 
of  Conceptual  Dependency  and  in  contrast  to  previous  linguistic  and 
computational  linguistic  work.  Ue  are  interested  not  in  syntactic 
structures  but  in  those  processes  that  allou  people  to  communicate 
their  thoughts  using  language. 

The  features  of  a  word  are  facts  associated  either  with  that 
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word  itself  or  with  the  concept  referred  to  by  that  word.  That 
"John"  is  a  proper  name  is  a  fact  about  the  word  "John."  That  "John" 
is  a  male  human  is  a  fact  about  the  concept  referred  to  by  the  word 
"John".  Features  are  represented  in  the  system  in  the  C.O.  notation 
described  in  section  il.  They  are  not  special  flags  or  marks  built 
specifically  for  the  analyzer,  and  though  they  are  used  primarily  by 
tne  analyzer  .  they  are  still  pieces  of  world  knowledge  and  are 
represented  like  other  pieces  of  world  knowledge. 

Uh'le  the  features  are  described  with  primitives  and 
relationships  \hat  are  general1’'  used  in  representing  information, 
the  expectations  are  described  with  functions  and  flows  of  control 
that  are  oriented  more  towards  language  processing.  The  basic 
control  structure,  involving  a  set  of  conditions  plus  a  set  of 
actions  to  be  performed  if  the  conditions  occur,  is  a  reasonable 
mechanism  for  many  other  memory  processes.  Charniak  [1]  uses  a 
similar  device  to  describe  the  way  sentences  tie  themselves  together 
in  children’s  stories. 

The  functions  that  specify  these  conations  and  actions  are 
ones  that  have  been  found  useful  for  analysis.  As  our  knowledge  of 
memory  processes  increases,  some  will  remain  as  they  are  and  others 
will  be  generalized  to  do  more  than  language  processing.  The 
functions  that  have  been  developed  fall  into  several  groups. 

CONCEPTUAL  DEPENDENCY  GRAPH  MANIPULATORS 

These  functions  create,  and  change  internal  counterparts  of 


conceptual  dependency  representations.  Graph  locations,  which  can  be 
fully  specified  by  strings  of  conceptual  role  markers  such  as  "the 
actor  of  the  caused  event",  are  holders  of  information.  That  is,  the 
graph  is  both  the  final  analysis  result  and  also  the  source  of  many 
of  the  expectations  that  are  made  while  analysis  is  going  on. 

One  function  then  just  takes  a  string  of  role  markers,  e.g, 

"  (<s  actor)"  ,  and  returns  the  conceptual  piece  found  at  the  er.d  of 
that  path.  Another  function  follows  such  a  path  and  ►.  ts  in  a 
conceptual  piece.  The  first  function  is  called  CHOiCE  and  the  latter 
CHOOSE.  Both  of  these  functions  work  with  a  conceptualization. 

There  exist  two  related  conceptualization  builders,  REPLACE  and 
IM8E0.  REPLACE  replaces  the  current  ct.  eptual  graph  (which  may  be 
empty)  with  a  new  one,  perhaps  built  from  all  or  part  of  the  old. 

Tnis  is  called  mainly  when  the  verb  found  in  an  utterance  provides  a 
conceptual  network  tying  together  the  other  elements  in  the  sentence, 
or  when  some  word,  like  "again"  ,  tells  the  analyzer  that  the 

• 

conceptual  network  from  the  verb  is  part  of  some  other  network. 

IMBED  doesn’t  change  the  conceptual  graph  as  such  but  affects 
how  *he  above  functions  beh3ve.  Basically  when  IMBED  is  called  with 
a  string  of  role  markers,  it  causes  the  conceptualization  referenced 
by  CHOICE.  CHOOSE,  and  REPLACE  to  be  moved  to  the  conceptual  piece 
referenced  by  that  string  of  markers.  5uppose  the  analyzer  had  so 
far  built  a  network  involving  the  communication  of  a  causal 
conceptualization,  e.g..  "advise"  which  is  the  communication  of  the 
belief  iiat  if  the  person  being  told  does  something  he  will  be 


hoppier  for  it.  Now  IMBED  would  be  called  with  the  argument 
"  iMOBJECT  CONI"  to  reset  the  conceptualization  to  be  the  action,  in 
the  communicated  idea,  which  would  cause  pleasure.  Any  further  work 
done  by  CHOOSE,  CHOICE,  and  REPLACE  would  be  in  building  up  this 
action.  There  is  of  course  a  functior  comp'ementary  to  IMBED  called 
RESET_ALL  which  resets  the  conceptualization  to  be  the  one  which 
IM3E0  was  called  upon.  At  the  moment  there  is  no  stacking  of  these 
embeddincis  and  a  disinclination  to  do  so.  Stacking  is  a  mechanism 
that  can  be  programmed  in  fairly  well-defined  ways  and  it  has  been 
the  basis  o;  many  programs  for  operating  on  data  bases.  However  its 
psychological  validity  is  questionable.  At  best  functions  th3t 
operate  recursively  on  trees  are  convenient  ways  of  simulating  come 
human  mental  processes.  In  the  analyzer  however  recursion  's  not  a 
basic  mechanism.  Hence  if  the  anaiyrof-  IMBEDs  more  than  once  it  wi  I 
be  able  to  reset  only  tc  the  most  recent  embedding  or  else  to  the 
outermost  level  of  the  conceptualization.  Such  an  app~oach  is 
related  to  the  representation  we  have  chosen.  Had  cur  system  been 
based  on  graphs  of  a  more  mathematical  nature,  with  a  few  priraitfves 
and  lets  of  trees  to  represent  everything,  then  embedding  would  be 
occurring  constantly  and  the  natural  way  to  work  with  these  trees 
would  be  with  recursive  routines.  However  Conceptual  dependency  is 
oriented  about  Structures  where  closely  reiated  elements  of  a 
conceptualization  appear  together  at  the  sane  level,  where  a 
processor  doesn't  have  to  keep  looking  up  and  down  a  tree  for 
information.  When  the  focus  of  manipulations  moves  up  or  dc  m  a 


level  in  this  kind  of  format  it  means  something  significant,  and  can 
be  expected  to  take  more  effort,  and  thus  be  less  likely  to  be  as 
simple  a  mechanism  as  recursive  stacking. 

In  talking  about  functions  that  add  conceptual  pieces  to  the 
graph  one  point  should  be  marie  about  these  pieces.  Uhiie  on  printed 
output  these  graphs  look  l,ke  the  i inear  version  of  C.D.  graphs, 
there  <s  one  extra  feature  about  them  which  doesn't  show,  flany  times 
a  itual iration  will  have  some  piece  appearing  in  several  places 

in  aph.  The  simplest  example  is  with  give”  where  we  have  an 
ATRANSing  with  an  icent'ty  between  the  actor  and  the  donor.  Uhen 
specifying  conceptual  pieces  to  REPLACE  we  can  enforce  this  identity 
to  the  extent  that  the  same  graph  is  pointed  to  in  be th  places.  Not 
only  does  it  become  obvious  to  o*her  programs,  in  inference  and  in 
generation,  when  two  elements  are  meant  to  be  the  same,  but  with 
respect  to  the  builci;ng  of  these  structures,  it  gives  the  result  that 
any  changes  mad?  to  an  element  shou  up  in  all  of  its  occurrences 
automatically.  Although  this  is  only  a  small  part  of  it,  this 
ability  to  do  explicit  references  indicates  a  representation  that  can 
hand'e  the  results  of  more  complex  reference  determination  from  the 
memorq  processor. 

As  we  shail  see,  often  the  verb  will  explicitly  provide  REPLACE 
and  CHOOSE  with  the  conceptual  pieces  that  it  needs.  However  there 
are  also  tim?5  L.hen  there  are  significant  conceptual  structures 
coming  from  other  words  in  the  sentence-  Por  example,  in  "John  gave 
Mary  a  headache.  '  "a  headache”  is  the  name  of  a  conceptual  structure 


25 


involving  the  feeling  of  pain,  and  the  analyzer  needs  to  incorporate 
this  structure  into  one  that  says  "John  caused  Clary  to  feel  pain  in 
her  head."  Hence  there  also  exists  a  routine,  called  UTILIZE,  that 
takes  words  that  refer  to  structures  and  turns  them  into  forms  for 
incorporat ion  with  REPLACE. 

SYNTACTIC  STRUCTURE  flANIPULATORS 

Another  s  or  functions  used  is  needed  to  operate  on  the 
syntactic  structure  cf  a  sentence.  The  description  of  these 
functions  ui  I  i  be  scmernat  brief.  They  have  not  been  the  main  focus 
of  our  effort.  This  is  because  much  work  has  already  been  done  on 
syntactic  analysis,  dost  other  approaches,  computational  and 
linguistic  and  even  psychological,  have  been  concerned  with  uhat 
could  be  obtained  using  just  syntax,  until  it  became  necessary  to  add 
on  a  littie  semantics  to  help  out.  The  approach  here  is  the  exact 
opposite,  to  see  what  can  be  done  from  tne  conceptual  side  and 
include  syntactics  when  they  become  important.  The  first  form  of  the 
analyzer  didn't  even  have  word  order.  Not  even  taking  into  account 
all  trie  arguments  that  have  been  made  in  favor  of  semantics  over 
syntactics,  it  would  seem  that  this  attack  o~  the  problem  has 
interest  in  that  it  does  relegate  syntax  to  a  truiy  subordinate 
ros i t i on. 

The  syntactics  used  by  the  analyzer  are  quite  simple.  This  is 
partly  because  less  time  has  been  spent  on  them  and  partly  because 
the  existence  of  a  conceptual  network  means  the  syntax  doesn't  have 
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to  carry  the  semantic  load  that  it  does  in  a  syntactically  based 
system. 

There  a re  three  surface  cases  used,  SUBJ,  OBJ,  and  RECiP,  uhi~h 
save  places  for  items  until  they  can  be  given  conceptual  roles  to 
play.  These  roles  are  primarily  determined  by  word  order,  with  a 
secondary  distinction  between  humans  and  objects,  so  that  RECIP  is 
generally  a  human,  if  it  occurs  at  all.  Uhen  embedding  occurs  these 
cases  are  saved  as  well,  and  reset  by  RESET_ALL  with  the  same 
comments  about  stacking  applying.  Further,  CHOICE  and  CHG05E  both 
know  hou  to  handle  these  cases,  and  the  analyser  can  add  and  extract 
information  from  them  just  as  with  the  conceptualization. 

These  word  order  cases  are  supplemented  by  the  use  of 
prepositional  markers.  The  analyzer  usually  knows  what  relationship 
a  preposition  is  expressing  either  frot  what  has  already  been 
understood  or  from  the  nature  of  the  obje.-  of  the  preposition.  The 
verb,  which  plays  a  central  role  in  this  system,  usually  does  most  of 
the  work  in  giving  an  expected  meaning  to  the  use  of  a  preposition. 
Still,  the  analyzer  needs  to  save  the  fact  that  such  and  such  item 
was  governed  by  such  and  such  preposition,  particularity  when 
prepositions  introduce  a  sentence  ("By  the  car  was  a...")  and  when 
backup  routines  are  colled. 

There  is  another  place  where  simple  syntactic  action  occurs: 
while  wa  fing  for  the  accumulation  of  enough  information  to  make  a 
conceptual  represents  ion.  This  happens  in  the  building  of  noun 
phrases.  Starting  with  the  recognition  of  an  article  or  adjective, 
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words  as  they  are  brought  in  are  not  converted  into  a  unified 
conceptualization  until  some  hing  is  seen  that  indicates  the  noun 
phrase  is  ended  The  end  of  the  sentence,  a  verb,  or  the  start  of  a 
new  noun  phrase  are  some  of  these  signals.  Knowing  uhat  the  main 
item  is  that  is  being  modified  by  the  previous  string  of  adjectives 
and  nouns  the  analyzer  can  make  a  conceptual  whole.  But  many 
adjectives  used  commonly  like  ‘‘short"  or  "sweet"  cannot  be  said  to 
have  meaning  until  they  nave  something  to  modify.  Granted  there  may 
be  things  that  seem  common  between  "a  short  stick1'  and  "a  short 
pause",  betueen  "a  sweet  candy"  and  "a  sweet  voice",  but  these  common 
elements  are  tco  vague  to  be  sufficient  to  be  definitions  for  the 
adjectives.  That  is,  given  some  such  unifying  theme,  we  still 
couldn’t  predict  reliably  what  modification  the  adjective  meant  with 
many  nouns.  There  are  times  when  we  generalize  word  usages,  when 
metaphors  are  involved,  but  for  the  moment  we  are  concerned  with  the 
common,  ingrained  uses  of  words.  Hence  ue  find  ourselves  here  with 
fairly  ambiguous  words,  i.e.  the  adjectives,  and  the  major  source  of 
information  on  what  to  do  with  these  words  coming  last.  There  is 
also  the  complicating  factor  of  noun  pairs,  such  as  "kitchen  table" 
and  "police  state".  There  exists  a  program  by  Sylvia  Weber 
Russel  I  (SJ  that  handles  a  number  of  these,  and  eventually  it  ui I i  be 
tied  in  with  the  analyzer. 

There  are,  then,  two  functions  for  handling  noun  phrases.  One 
takes  neu  words  and  coMectc  them  into  a  simple  list,  waiting  for  the 
end  of  the  phrase.  The  other  is  called  when  the  phrase  end  is  noted 


and  converts  this  list  into  a  normal  conceptual  structure.  This  new 
structure  is  then  returned  as  the  meaning  of  the  noun  phrase  and 
behaves  as  a  unit  for  such  functions  as  CHOOSE  and  FEATURE. 

MEMORY  INTERFACE  FUNCTIONS 

FEATURE  brings  us  to  another  open-ended  set  of  functions,  which 
interrogate  the  memory’s  world  knowledge  for  information  about 
things.  These  things  may  be  either  words  cr  concepts.  FEATURE  i9 
the  only  memory  i nterrogat ion  function  currently  used  by  the  analyzer 
(other  possibilities  are  the  class  III  predicates  used  by  generator, 
as  discussed  in  section  IV).  It  takes  as  one  argument  either  a  word 
or  a  simple  conceptual  piece  consisting  of  a  PP  plus  modifiying 
conceptualizations  and  as  the  other  argument  some  property  value, 
such  as  “human"  or  "proper"  (as  in  proper  nouns).  These  property 
values  belong  to  uhat  are  called  contrast  sets,  such  as  "(human, 
animal,  physical  object)".  These  contrast  sets  are  needed  oecause 
there  are  often  times  when  the  analysis  depends  on  which  element  of 
the  set  a  particular  word  or  concept  is  associated  with.  It  is 
important  to  note  that  these  contrast  sets  are  not  hierarchical,  at 
least  to  3  great  degree.  Although  Mary  being  a  human  implies  that 
she  is  an  animal  whicn  implies  she  ;s  a  physical  object,  the  way  in 
which  "Mary"  is  hand'ed  in  language  differs  depending  on  whether  she 
is  no  more  than  an  object  or  no  more  than  an  animal,  FEATURE  is  a 
veru  simple  information  retrieval  function,  A  particular  complex  of 
features  has  been  chosen  for  some  reason  and  FEATURE  is  used  to  find 
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out  what  else  is  true  in  this  complex.  Thus,  by  a  criterion  of 
commonness,  "John"  is  chosen  as  referring  to  "J0HN1"  which  is  "the 
man  called  John"  sense  of  "John".  FEATURE  then  tells  us  that  "J0HN1" 
is  a  man,  and  that  an  English  name  is  involved. 

There  are,  as  mentioned,  other  functions  in  the  analyzer,  but 
they  are  subservient  to  the  ones  discussed  above.  Only  one  more 
piece  of  the  analyzer  needs  to  be  described  before  some  examples  are 
given.  This  piece  is  the  monitor,  or  supervisor,  the  piece  that 
takes  definitions  of  words  in  terms  of  these  functions  and  executes 
their  instructions.  This  monitor  is,  and  is  meant  to  be,  ,ery 
simple,  its  job  is  to  do  bookkeeping  on  the  following  variables. 

SENTENCE  -  this  is  the  utterance  being  analyzed.  It  is 
constant  thrcugh  the  analysis. 

WORD  -  this  is  the  current  word  in  the  sentence  that  is  being 
looked  at.  Normally  UORD  is  set  to  each  successive  word  in  SENTENCE, 
going  from  left  to  right. 

SENSE  7  this  is  the  current  sense,  or  meaning,  that  is  being 
worked  with,  it  is  usually  either  the  meaning  of  UORD  or  of  the  noun 
phrase  containing  WORD.  A  sense  of  a  word  is  a  name  for  a  set  of 
requests  and  features.  Features  are  simply  conceptualizations. 
Requests  are  analysis  instructions. 

REQUESTS  -  this  is  a  list  of  requests  which  is  unordered  uith 
one  exception.  The  monitor  continually  rechecks  this  list  to  see  if 
changes  to  UORD,  SENSE,  CONCEPT,  or  REQUESTS  itself  have  caused  any 
of  the  requests  to  become  applicable.  Requests  are  representations 
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nf  the  expectations  a  word  sets  up  of  situations  that  might  occur  and 
actions  to  take  in  those  cases.  The  unordered  rechecking  is  meant  to 
be  a  simulation  of  a  parallel  control  structure  where  each  request 
looks  to  see  if  it  should  do  anything,  independent  of  the  other 
requests.  The  only  exception  to  this  concerns  those  requests  that 
are  activated  when  some  phrase  or  clause  ends.  For  example,  in  "John 
wanted  Mary...,"  the  analyzer  assumes  that  "Mary"  is  beginning  a 
clause  involving  something  involving  Mary  that  John  wants.  If 
instead  that  is  the  uhole  sentence  and  nothing  more  has  been  found 
out  when  the  end  of  the  sentence  is  reached  then  a  def  jit  assumption 
is  made  that  John  wants  Mary  to  come  to  him.  These  requests  that  are 
called  by  the  end  of  something  are  always  placed  at  the  end  of  the 
request  list.  This  is  equivalent  to  cone i dering  them  as  independent 
processes  thai,  in  being  called  by  the  absence  rather  than  the 
presence  of  something,  wait  to  make  sure  that  "more  real"  requests 
have  had  their  say. 

ANSUER  -  this  is  the  concept'  *1  representation  of  SENTENCE  that 
the  analyzer  is  building.  It  is  tne  variable  whose  value  is  returned 
by  the  analyzer. 

CONCEFT  -  this  is  a  pointer  to  either  ANSUER  or  to  some 
subconceptual izat  on  in  ANSUER.  This  points  to  the  place  where  the 
building  activitiy  is  going  on  at  any  point  in  the  analysis.  Thus  it 
starts  off  the  same  as  ANSWER  but  uhen  an  embedded  conceptualization 
is  being  built  it  points  to  that  instead. 

Attached  to  each  word  that  appears  in  SENTENCE  are  one  or  more 
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senses,  that  is,  labels  of  sets  of  features  anu  requests.  Requests 
are  of  the  form  "(  TEST  ACTION  FLAG)".  TEST  and  ACTION  are  the 
crucial  elements  of  a  request.  TEST  is  a  (Lisp)  predicate  and  ACTION 
is  a  (Lisp)  function,  both  built  from  Lisp  'unctions  and  those 
functions  that  have  been  described  above.  Uhen  UGRD  changes,  the 
monitor  first  checks  REQUESTS  for  instructions,  adds  eny  requests 
attached  to  WORD,  then  finds  the  current  sense  for  WORD  (setting 
SENSE  equal  tc  it),  then  checks  REQUESTS  again,  then  adds  the 
requests  that  are  part  of  SENSE  to  REQUESTS  and  steps  WORD  along  in 
SENTENCE.  In  gene-al,  T£ST  predicates  make  reference  only  to  CONCEPT 
and  the  feature  aepects  of  UORO  and  SENSE.  Checking  a  request  means 
evaluating  the  TEST.  If  TEST  is  not  true  nothing  happens  and  the 
monitor  goes  on  to  the  next  request.  If  TEST  is  true,  then  ACTION  is 
executed  and  FLAG  is  altered.  FLAG  is  a  bookkeeping  mark.  When  it 
is  NIL  it  me3ns  the  request  has  not  been  used  yet,  while  T  means  that 
the  request  has  already  been  used. 

REQUESTS  is  changed  by  either  the  monitor  or  an  ACTION.  In  the 
former  case  uoros  and  senses  have  their  requests  added  to  the  list. 

In  the  latter  case,  either  the  function  If'BED  which  introduces 
clauses,  or  a  simpler  one  for  starting  prepositional  phrases,  saves 
the  current  REQUESTS  and  replaces  it  with  another  set.  RtSET_ALL 
restores  REQUESTS  to  the  original  set  when  it  is  called.  IHBED  thus 
works  with  three  information  sets:  the  conceptualization  being  built, 
the  syntactic  structure  being  built,  and  the  expectations  being  made. 

One  other  operation  that  the  monitor  performs  is  to  initialize 
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REQUESTS  to  3  request  which  looks  for  any  noun  phrase  that  will  be 
the  subject.  This  is  done  whenever  a  new  sentence  is  begun. 

The  best  way  to  descriLe  how  these  functions  are  put  together 
to  form  requests  is  by  examples.  The  first  example  will  be 
straightforward,  the  second  will  show  how  words,  like  "giv<*’!,  can 
work  to  tie  together  the  contents  of  other  words,  and  the  last 
example  will  show  how  words,  like  "by",  can  work  to  tie  together 
large  conceptual  structures. 

The  first  example  is  "John  advised  Nary  to  drink  the  wine."  The 
requests  attached  to  the  words  in  the  example  are: 

advised  -  (T  (CHOOSE  TIME  (BEFORE  (NEUJIME)  (CHOICE  TIME)  X)  NIL) 
drink  -  ((NEEDJIHE)  (CHOOSE  TINE  (CHOICE  TINE)  )  NIL): 

Nost  of  these  functions  haven't  been  discussed  and  the  requests 
are  here  for  completeness.  Basically  if  words  have  requests  at  all 
they  are  ones  like  time  choices.  The  above  requests  say  that 
"advised"  always  refers  to  a  past  event  and  "drink",  if  a  time  is 
needed,  refers  to  a  present  one.  However  the  "to"  that  is  set  by 
"advise"  sets  " (NEE0_TINE) "  to  false,  so  the  time  will  be  untouched 
by  "drink".  Past,  presen.  and  future  mean  before,  during  and  after 
the  time  of  the  surrounding  conceptualization,  respectively. 

The  senses  for  the  words  in  this  sentence  are,  for  programming 
convenience,  usually  the  same  as  the  woi  d  with  a  numeric  suffix 
attached.  The  requests  attached  to  the  senses  that  appear  ir,  this 
sentence  are: 


J0HN1 .  NARY1;  none: 


A0V1SE1:  (T  (REPLACE  CONCEPT  (QUOTE  ((ACTOR  (A  SUBJ)  <->  ( *M TRANS*) 
TO  (*CP*  PART  (A  REC IP)  REF  (*THE*) ) 

FROM  (*CP*  PART  (A  SUBJ)  REF  (*THE*) )  NOBJECT 

((COM  (NIL  TINE  (>)  NODE  (NIL))  <sC 

((ACTOR  (A  REC  IP)  -:=>T  t*JOY*)  ^s>F  (*JOV*))  INC  (2) 

TINE  (,-  NOBJECT  CGN  TINE)  NODE  (NIL)))))  FOCUS  ((ACTOR)) 
NODE  (NIL)  TINE  (NIL))))  NIL) 


This  request  produces  a  concept. .ai  form  equivalent  to 

l-->  (  REC  IP  )  1 
M  R  | 

(  SUBJ  )  *M TRANS*  (A)  ~-| 

i— <  (  SUBJ  ) 


where  A  is  the  foi  Soiling  conceptual  form: 

(  ) 

/c\ 

Ilf  I— >  *JOY* 

(  REC  IP  )  <»»| 

t  I--S  *J0Y* 

I  NC| 

U2) 


that  is,  someone  is  being  told  that  doing  something  nil  I  please  him. 

(T  (DEFPROP  TO  T30  CURRENT)  NIL) 

This  request  makes  3  prediction  aoout  future  use  of  "to". 

((FEATURE  SENSE  (QUOTE  HUNAN))  (CHOOSE  RECIP  SENSE)  NIL) 
Thin  request  says  that  the  next  human  is  the  person  receiving 
the  NTRANSing.  it  could  be  written  so  as  to  put  the  human  into  the 
graph  directly  but  it  is  stored  in  RECIP  just  in  case  a  parallel 
syntactic  structure  is  needed. 

TO0:  (T  (PROG  NIL  (INBED  (NOBJECT  CON!  ((SUBJ  CHOICE  RECIP) 

(TiNE  AFTER  (NEW  TINE)  (CHOICE  TINE)  X))  ( (BREAKPOINT) 
(RESET  ALL)  NIL)  F  (SETQ  USE  TINE  NIL  ))  NIL) 


This  says  that  when  "to"  is  found  start  the  conceptual  building 
at  CON  in  the  HQBJECT,  set  the  time  to  be  after  the  communication, 
set  the  SU8J  of  the  infinitive  veH:>  to  be  the  person  being  advised, 
and  set  REQUESTS  to  one  looking  for  the  end  of  the  clause. 

DRiNKl:  (T  (REPLACE  CONCEPT  (QUOTE  ((ACTOR  (#  SUBJ!  <=>  (*INGEST*) 
OBJECT  (#  OBJ)  TO  (*!NS10E*  PART  (#  S'JBJ))  FROM 
(*M0UTH*  PART  (U  SUBJ))  )  NODE  (NIL)  TIME  (NIL)!))  NIL) 

graphical ly  this  is: 

PART 

|— >  *  INSIDE*  - — (SUBJ) 

0  0  I 

f  SUBJ  )  <*>  fcINGEST*  «—  (  OBJ)  -~\  PART 

I— <  *M0UTH*  4. - (SUBJ/ 

another  request  uithDRINKl  is: 

((FEATURE  SENSE  (QUOTE  PP) )  (CHOOSE  0«J  SENSE)  NIL) 

This  request  says  that  the  next  object  it  finds  is  the  thing 
being  drunk. 

The  features  of  the  words  are: 

John,  Mary  -  in  a  contrast  set  cal len  *U0RDTYPE*  they  have  the 
value  *NAME*.  which  means  they  don't  require  an  article: 

JOHN! ,  MARY1  -  both  have  the  feature  "HUMAN",  and  JGHN1  also 
has  the  feature  "MALE"  versus  MARYl's  "FEMALE"  but  that  is  not  needed 
here.  Both  also  have  the  feature  "PP"; 

U1NE1  -  has  the  feature  "PP",  which  is  the  oniy  one  needed 

here. 

The  analysis  of  the  sentence  "John  advised  Mary  to  drink  the 
wine"  proceeds  simply  enough.  The  initial  request  looking  for  a 
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subject  (SU8J)  is  satisfied  by  "John'1.  "Advised"  and  'ADVISE1" 
satisfy  no  requests  but  add  their  own  tc  the  set,  and  further  change 
CONCEPT  i  xl  hence  ANSUER)  to  a  cor>_<.ptual  skeleton  of  the  fITRANS 
action,  "flaru"  and  "flrtRYl"  satisfy  the  request  looking  for  a 
recipient  of  the  fITRANS.  "To"  an,  "TUi"  move  CGNCEPT  to  point  to  the 
ceptua I i cat i on  being  fITRANSed  and  reset  REQUESTS.  "John"  and 
V  satisfy  the  request  nou  being  made  for  u  SUBJ.  "Drink"  and 
"PFiiNKi"  put  the  conceptual  skeleton  for  a  drinking  act. on  ir.to  head 
of  the  causal  in  the  fIQBJECT  slot,  "lime"  ar,d  "UINE1"  satisfy  the 
request  looking  for  an  GBJ  of  the  drinking.  The  end  of  the  sentence 
causes  REQUESTS  and  the  syntactic  cases  and  CONCEPT  to  be  returned  to 
the  values  they  had  be  tore  “to”  uas  errountered.  REQUESTS  is  checked 
again,  and  then  the  analysis  is  ever.  The  value  of  ANSUER  (in 
qraphic  form)  is: 

0 

♦NARY*  *iNGEST*  < —  *UINE* 
P  |._>  *NARY*  fl  /c\ 

♦JOHN*  <  =  >  *HTRANS*  <  — |  - 

|  jil  |—  >  *J0Y* 

♦JOHN*  *MARY*  •: s  =  =  i s  | 

*  | — <  *J0Y* 

INC! 

1+2) 

Because  of  space,  the  fact  that  the  recipient  case  of  fITRANS 
involves  the  Conscious  Processors  (CPs)  of  the  people,  not  the  people 
themselves,  is  rot  shoun  in  this  diagram.  Also,  the  times  have  been 
left  out. 

The  secoro  example  is  "June,  gave  Nary  a  beating."  i.e  focus 
here  is  on  the  way  in  uhich  "give"  is  used  mainly  to  full  together 
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This  soy?  t Ha t  when  "to"  is  found  star t  the  cortceptua1  building 
at  CON  in  tie  HOG  'ECT,  set  the  time  to  be  after  the  communication, 
set  the  SUBJ  of  the  infinitive  verb  to  be  the  person  being  advised, 
and  set  REQUESTS  to  one  looking  for  the  end  of  the  clause. 

0RINK1:  (T  (REPLACE  CONCEPT  (QUOTE  ((ACTOR  iff  SUBJ)  <«>  (#INCEST*) 
OBJECT  iff  OBJ)  TO  UI.N3I0E*  PART  iff  SUBJ))  FROn 
<*M0L'TH*  PART  iff  SUBJ) !  )  (10DE  (NIL)  TIME  (NiL))))  NIL) 

graph i ca I  I y  this  is: 

PART 

I— >  ♦INSIDE*  « - (SUBJ) 

0  D  | 

<  SL'bJ  )  <->  *  INGEST*  —  (  OBJ)  «--- 1  PART 

i— <  *J10UTH*  - (SUBJ) 

another  request  with  0RINK1  is: 

((FEATURE  SENSE  (QUOTE  PP) )  (CHOOSE  OBJ  SENSE)  NiL) 

This  request  says  that  the  next  object  it  finds  is  the  thing 
be  i  no  cfrunk. 

The  features  of  the  words  are: 

John.  Nary  -  in  a  contrast  set  called  *U0RDTYPE*  they  have  the 
value  *NAME*.  which  means  they  don’t  require  an  article; 

JOHfll ,  MARY  1  -  both  have  toe  feature  "HUMAN",  and  J0HN1  also 
has  (he  feature  "MALE”  versus  MARYl’s  "FEMALE"  but  that  is  not  needed 
hei  e.  Both  also  have  the  feature  "PP“: 

Hi  NET  -  has  the  feature  "PP”,  wnlch  is  the  only  or  needed 

here. 

The  analysis  of  the  sentence  "John  advised  Mary  to  drink  the 
uine"  proceeds  simply  enough.  The  initial  request  looking  for  a 


the  other  elements  of  the  sentence  to  yield  a  meaning  paraphrasab I e 
in  English  as  "John  beat  Mary."  This  is  3  very  common  use  of  "give" 
and  (here  are  many  other  words  ‘hat  can  function  the  same  way  For 
e. ample  "John  took  a  walk"  means  the  same  as  "John  walked  for  a 
while,"  and  John  got  Mary  a  job"  is  related  to  “John  gave  Mary  a 
job."  in  all  these  examples  the  object  is  the  name  of  some  action  or 
situation,  and  "give",  "get”  and  "take"  take  these  situations  and 
apply  them  in  specific  ways  to  the  other  elements  they  govern. 

Soma  of  the  requests  associated  with  "GiVEi",  which  is  the 
sense  of  "give"  that  handles  the  above  example,  plus  ones  like  “John 
gave  Mary  a  headache,"  and  “John  gave  Mary  a  book,  are  like  the  ones 
described  in  the  previous  example.  Thus: 

GIVEI:  (T  (OEFPROP  TO  TO.  CURRENT  I  NIL) 

((FEATURE  SENSE  HUMAN)  (CHOOSE  RECIP  SENSE)  NIL) 

((FEATURE  SENSE  P08J1  (CHOOSE  OBJ  SENSE)  NIL) 

(T  (REPLACE  CONCEPT  (QUOTE  ((ACTOR  iff  SU6J)  <->  UATRANS*) 

TO  (ff  RECiPt  FROM  iti  SUB-1)  OBJECT  [U  OBJ))  FOCUS  ((ACTOR!) 
TIME  (NIL)  MODE  (NIL!)))  NIL) 

graphically  this  last  is: 

0  R  I— >  (  RECIP  ) 

(  SUBJ  I  to  *ATRANS*  —  (  OBJ  I  — -| 

i  .  (  n  ip  i  | 

j - - v  \  juuj  i 

but  the  important  'equest  tor  the  examole  (which  is  paraphrased 
here  in  English  ‘or  readability)  is  this  one: 
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TEST  if  RcNSE  is  a  conceptualization  of  »he  form 
one!  •-=»  do 


/  \ 

lii 

one 2  <=>  state 

(that  is,  someone  puts  someone  into  a  state)  then 
ACTION  REPLACE  CONCEPT  (with  UTILIZE)  with  SENSE  uhere 
onel  is  replaced  by  SUBJ  and  one2  try  RECIP 

The  sense  of  ’’beating"  that  is  assumed  here  ha9  the  following 


meaning  (graphically): 


ONE 


one: 


0  PART  0  I— >  0NE2 

*PR0PEL*  *•-  (*HAND*  0NE1!  *— | 

/  \  I— <  ONE  1 

I I  |  - . REPEATEDLY 

| | |  MANNER 

III  VAL  PART 

<*ss>  *PHYSC0NT*  * - «HAN0v  * —  ONEI 


This  is  the  represent)  ion  for  repeated  hitting.  The  analysis 
returned  for  "John  gave  Mary  a  beating"  will  look  like  this  except 
that  "JOHNi"  will  appear  everywhere  that  "ONE!"  does  and  "MARY1" 
everywhere  that  "0NE2"  does. 

T  e  next  example  is  "John  killed  Mary  by  choking  Mary."  It 
contrasts  ui  th  t1  i.^-t  example  in  the  kind  of  manipulation  that 
occurs.  In  "John  gave  'laru  a  beating,"  the  meaning  of  "give”  was  a 
set  of  actions,  not  some  conceptual  piece.  The  actions  built  a 
conceptualization  from  the  other  words  in  the  sentence.  In  this 
example,  "John  killed  Mary  by  choking  Mary,"  the  word  "by"  ties 
together  two  large  conceptual  pieces.  "John  kilied  Mary"  and  "John 
choked  Mary,"  "By"  asks  questions  about  conceptualizations  rather 
than  about  words  and  differs  from  "give"  ;n  that  way.  To  kiil 
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someone  means  to  do  something  to  make  that  someone  die.  To  choke 
someone  means  to  grasp  his  (or  her)  neck  causing  him  to  be  unable  to 
breathe. 

"BYl",  the  name  assigned  to  this  use  of  "by",  has  the  following 
job  to  do.  It  has  to  tie  together  two  conceptualizations,  making  one 
"instrumental"  in  the  occurrence  of  the  other.  If  the  two  actions 
are  simple  EVENTs,  then  the  main  act  has  the  other  in  its 
I NSTR'JflENTa I  case.  (Any  ACT  can  take  an  INSTHUtlENTal  case,  which 
must  a. ways  b-*  filled  by  an  entire  EVENT.  This  INSTRUflENT  further 
specifies  t*  e  nature  of  the  ACT  on  which  it  is  dependent.)  If  the 

main  action  is  i  causal  and  the  causing  action  is  unspecified 

(graphically  there  is  a  dummy  “do"  written  for  the  act)  then  the 
secondary  action  is  helping  to  specify  this  causing  act.  If  the 
secondary  action  is  a  simple  act  ther  it  is  a  straightforward 
replacement  of  this  act  in  the  unspecified  slot.  This  happens  in 
"John  angered  flary  by  giving  Bill  the  book."  If  the  secondary  action 
is  a  causal  itself  then  the  result  event  of  this  secondary  action  is 

in  turn  the  antecedent  event  of  the  main  action.  This  happens  in  our 

examp  I e. 

"BY!"  a ! sc  has  a  few  other  duties,  like  preparing  the  analuzor 
for  an  "ing"  form  of  a  verb,  and  making  the  current  subject  the  new 
subject  of  the  "by"  clause.  An  English  paraphrase  of  the  request  set 
for  "BYl"  (which  involves  too  many  of  those  format  functions 
mentioned  before  to  be  usefully  written  out  here)  reads  like  this: 
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TEST  if  CONCEPi  is  3  causal  then 

ACTION  hEPlACE  CONCEPT  ,.-itn  ((CON  (NIL)  a  CONCEPT))  ,  that  is, 
form  a  space  for  the  secondary  concept, 
and  also  acid  the  following  request: 

TEST  until  the  secondary  concept  has  been  found  then 
ACTION  if  the  old  concept  had  an  unspecified  causing  action 
then  if  the  secondary  is  also  a  causal  then 

replace  the  unspecified  action  with  the 
result  event  of  the  secondary,  else 
CONCEPT  is  REPLACEd  bu  the  whole  secondary  action 
causing  the  result  event  of  the  main  action, 
else  CONCEPT  is  left  the  conjunction  of  two  events 
and  also  (IMBED  CON  HSUBJ  CHOICE  SUBJM 

((BREAKPOINT)  (RESET. ALL)  NIL)) 

The  last  action,  the  IMBEO,  says  that  from  this  point  on 
conceptual  building  will  be  done  in  the  CON  space  just  attached  with 
the  REPLACE,  and  it  also  says  that  REQUESTS  will  be  set  to  look  for 
the  end  of  the  clause. 

TEST  if  CONCEPT  is  a  simple  act  then 

ACTION  (If1BEC  INST  ( (SUBJ  CHOICE  SU8J) )  ((BREAKPOINT)  (RESET_ALL)  NIL)) 

This  request  p  icaoi.  in  »  simple  instrumental  use  of 
"by",  such  as  "John  gave  nary  toe  book  by  handing  it  to  her."  The 
last  request  looks  for  an  “ing"  form  cf  a  verb  to  follou  the  "by". 

TEST  if  L';RD  is  a  word  with  suffi*  INC 

ACTION  then  give  it  the  Keanir.^  of  the  root 

When  this  sense  of  "by",  which  is  set  by  "KiLLl",  is  used  the 
final  analysis  of  "John  killed  Mary  by  choking  Mary"  15: 
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0  PART 

0 

0 

♦JOHN* 

<»»*> 

♦GRASP* 

—  *NFCK*  *T1ARY*  *NARY* 

<»**> 

♦  INGEST*  *A1R* 

/  \ 

/  \ 

111 

/\ 

ill 

ill 

0 

III 

|— >  *HEALTH* 

♦NARY* 

<•.=*=> 

♦INGEST* 

*AiR* 

I  VAL  t 

0 

♦NARY* 

OBMES 

1  -10 

1 

— <  *health* 

VAL  t 
X 


We  have  seen  some  examples  non  of  the  core  of  the  conceptual 
analyzer.  We  have  seen  the  kinds  of  functions  that  are  used  and  the 
kinds  of  results  that  are  constructed.  The  discussion  has  been  brief 
a^d  the  analyzer  described  is  tar  from  complete.  Houever  it  can  be 
seen  that  the  basic  philosophy  of  Conceptual  Oeoendency  has  been 
continued  here.  Not  only  has  the  stress  been  on  a  conceptual  rather 
than  just  a  ianguage  oriented  semantic  output,  but  the  same 
criterion  of  naturalness  that  Ie3ds  to  one  representation  rather  than 
another  has  been  used  in  deciding  what  decisions  cause  what  steps  in 
the  analysis  process.  The  assumption  implicit  in  the  requests  for 
‘give’,  that  humans  are  consistently  treated  differently  from 
physical  objects,  is  such  a  decision.  The  control  structure  itself 
uas  worked  out  from  an  assumption  that  natural  language  processing 
does  not  involve  global  routines  that  are  based  on  syntactic 
structures,  but  rather  such  processing  is  carried  out  by  short 
programs  and  expectations  associated  with  the  words  of  the  language. 
The  idea  of  sets  of  requests  was  a  straight  forward  implementation  of 
this  assumption. 
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in  the  last  fet.<  examples,  the  how  of  analysis  will  be  omitted 


and  only  the  input  and  output  will  be  shown. 


JOHN  AGGRAVATED  MARY  BY  GIVING  BILL  THE  BOOK 
Tinea  :  uval  *T*n 
Tinei  :  ( (BEFORE  T I M08  X)) 

T I M02  :  ((BEFORE  T IM81  X)) 

((CON  ((ACTOR  ( J0HN1)  <«>  (*ATRANS*)  TO  (BILL1)  FROM  (J0HN1)  OBJECT 
( BOOK  1  REF  (*THE*) ) )  FOCUS  ((ACTOR))  HOOE  (NIL)  TRIE  (TIf102) )  <s 
((ACTOR  (MARY1 )  <s>T  UANGEfi*)  <a>F  UANGER*) )  V I  HE  ( T I C101 )  INC  (2)))) 


This  is  the  internal  representat ion  of  the  fol toning  graph  structure: 


0  R  |-->  *BILL* 

♦JOHN*  <«>  *ATRANS*  *BO0K*  • — | 

/  \  i--<  *J0HN* 

II! 

Ill  |-->  *ANG£R* 

♦NARY*  <ss€e j 

*  |~<  *ANGER* 

INC  | 

(+2) 

The  next  two  examples  show  how  much  concepts  with  different 
features  can  affect  the-analysis  of  that  sentence.  The  analyzer 
assumes  that  when  someone  wants  someone  else,  he  wants  that  person  to 
come  to  him,  but  when  he  wants  some  physical  object,  he  wants  to  have 
that  object. 

JOHN  WANTS  MARY 
Tinea  ;  hval  *t*d 
Tinei  :  (  (AFTER  TR100  X)! 

Tif102  :  ((AFTER  TIH00  X)) 

((CON  ((CON  ((ACTOR  (flARYl )  <•>  UPTRANS*)  OBJECT  (MARvl)  TO  (J0HN1) 
FROM  (Nil))  TINE  (TR102))  <?C  ((ACTOR  (JOHN!)  <s>T  (*J0Y*)  <b>F  (*J0Y*)) 
INC  (2)  TINE  (T|n01i}))  < s >  (*T.L0C*  VAL  ,*LTn*  PART  (JOHN! )  REF  (*THE*) 
)))  NODE  (NIL)  FOCUS  ((<s>  VAL  PART!)  TIME  (TIf180) ) 


A2 


VAL  PART 

U  <sss>  *NLOC*  » —  *LTN*  i —  *JOHN* 
t 


I  G  D  I— >  *JOHN* 

j  *NARY*  <=  ==>  *PTRANS*  » —  *NARY*  « — j 
!  /c\  --< 


III  |->  *JOY* 
*J0HN*<25SF | 

t  I— <  *JOY* 
INC  | 

(+2) 


JOHN  UANTS  A  BOOK 
TIN00  :  ((VAL  *T*) ) 

T 1 1101  :  ((AFTER  T 1 H00  X> ) 

T I n02  :  ((AFTER  TH100  X)) 

((CON  ((CON  ((ACTOR  (*ONEl«)  <->  U A TRANS*)  OBJECT  (BOOK1  REF  (*A*)) 

TO  (JOHN1)  FROM  (*ONE*) )  TINE  (TIN02))  <h  {(ACTOR  (JOHN!)  <«>T  (*JOY*) 
<s>F  (*JOY*))  INC  (2)  TINE  (Tif101 )  > ) )  <s>  (*NLOC*  VAL  ULTN*  PART 
(JOHN1)  REF  (*THE*)  > ) )  NODE  (NIL)  FOCUS  ((<s>  VAL  PART))  TINE  (TIN00)) 

VAL  PART 

U  <ss*>  *NLOC*  * —  *LTN*  » —  *JOHN* 

t  • 


0  R  I— >  *JOHN* 

*ONE*  <  =  =>=>  *A TRANS*  •---  *BOOK*  < — | 

/c\  |--<  *ONE* 

III 

- -i|| 

III  I— >  *JOY* 

*J0HN*<M«l| 

*  j  *  JO  Y * 

INC  | 

U2) 
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IV.  PRODUCTION  OF  NATURAL  LANGUAGE  SENTENCES 

FROM  CONCEPTUAL  REPRESENTATIONS 

Uhpn  the  analysis  of  a  source  sentence  has  been  completed, 
control  passes  to  the  memory  model,  which  integrates  the  conceptual 
structure  produced  by  the  analyzer  into  the  existing  memory.  Neither 
the  source  sentence  nor  the  words  comprising  it  are  used  in  the 
remainder  of  the  paraphrase  production,  Memory  processes  are  not 
described  in  this  report,  but  are  discussed  in  [33.  Ue  can  think  of 
the  integration  process  as  one  of  tying  references  to  already  known 
items  to  internal  nodes  which  represent  these  items,  and  of  creating 
neu  nodes  to  represent  new  items  and  concepts.  Thus  an  associative 
memory  is  maintained,  in  which  a  node  representing  the  individual 
‘John  Smith'  has  pointers  emanating  from  it  to  every 
conceptual 'nation  in  which  ‘John  Smith'  plays  a  part. 

The  problem  remaining  is  to  taki  the  conceptual  representation 
which  the  analyzer  produced  for  the  input  and  find  an  English 
sentence  whi ch- expresses  the  meaning  represented.  The  words  and 
syntax  of  the  original  sentence  have  been  discarded.  Thus  the  fact 
that  the  conceptualization  was  produced  from  an  English  sentence 
which  was  the  input  to  3  paraphrase  program  is  not  relevant  *o  this 
problem.  The  conceptual  i  zat  ion  could  just  as  well  h-ive  come  from  a 
German  i npu t  in  a  machine  translation  task,  through  3  chair,  of 
deductions  in  a  question  answerer,  or  through  some  information 
gathering  motivation  in  .. o  interviewing  program.  Uhat  is  required 
then  is  a  routine  which  can  take  an  arbitrary  conceptual 


44 


representation  and  realize  it  in  English  --  i.e.,  a  general 
Conceptua I -Eng I i sh  generation  program. 

Both  linguists  and  computer  scientists  have  designed  systems 
for  language  generation.  These  can  generally  be  classified  33  either 
random  or  directed.  Random  systems  [12]  attempt  to  produce 
grammatical  English  sentences,  starting  only  with  the  goal  "produce  a 
grammatical  English  sentence,"  Such  systems  can  be  used  to  test 
syntactic  tneories.  and  could  be  used  to  test  semantic  theories  as 
well  if  i;he  goal  Here  "grammatical  AMD  meaningful". 

Directed  systems  posit  some  underlying  structure  and  have  as  a 
goal  "produce  a  sentence  having  the  specified  underlying  structure." 
Our  goal,  to  "produce  grammatical  English  sentences  with  a  specified 
meaning",  certainly  rails  under  thi s  paradigm.  Unfortunately,  the 
language  free  a  set  of  conceptual  representat ion  renders  approaches 
which  have  been  previously  tried  inapplicable  to  our  task.  Some  of 
the  directed  approaches  13J  assume  a  syntactic  underlying  structure. 
MARGIE  does  not  Know  the  syntax  of  the  desired  output.  Others  assume 
a  semantic  structure  [1R1 :  these  specify  the  desired  meaning,  but  do 
so  in  terms  of  linguistic  units  (word  senses)  not  present  *>n 
conceptualizations.  Thus  MARGIE  requires  a  new  approach  to 
genera t ion. 

Tne  task  of  producing  an  English  sentence  from  a  language  free 
meaning  structure  is  indeed  very  complex,  but  several  subtasks  may  be 
i den  t i f i ed: 
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i)  Uords  must  be  chosen  to  use  in  the  sentence. 

ii)  The  words  must  be  tied  together  by  English  syntax  relations  (or 
relations  from  which  the  syntax  can  be  produced). 

iii)  The  words  and  relations  must  be  linearized  to  form  an  English 
sentence. 

Although  it  mau  net  be  necessary  to  organize  these  subtasks 
sequentially,  it  seems  that  if  (i)  and  (ii)  could  be  accomplished, 
then  (iii)  could  make  use  of  the  generative  mechanisms  devised  for 
directed  non-conceptual  generators. 

Uhat  I1ARGIE  does  in  fact  is  to  break  up  generation  into  two 
distinct  phases.  First  a  ‘syntax  network*  is  created  (steps  (i)  and 
(ii)  occurring  in  parallel),  then  a  grammar  products  an  English 
sentence  from  the  network.  The  remainder  of  this  section  is  devoted 
to  describing  these  syntax  networks  and  how  conceptual  and  linguistic 
knowledge  are  used  in  their  formation. 

The  reader  may  wonder  how  MARGIE,  having  thrown  away  the  source 
sentence,  can  be  sure  that  the  English  realization  it  arrives  at  will 
be  a  true  paraphrase  rather  than  the  ‘identity’  paraphrase.  The 
answer  is  that  it  cannot:  the  generator  actually  produces  several 
different  realizations  from  the  conceptualization.  The  original 
sentence  may  we  I  1  be  among  these. 

First  the  process  bu  which  a  single  English  realization  is 
obtained  will  be  described:  this  description  will  be  augmented  later 
to  explain  the  production  cf  multiple  realizations  ( i . e . , 
paraphrases)  from  a  given  conceptualization. 
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Uni  ike  the  conceptual  representation,  the  syntax  network  is 
very  much  language  depende;  both  the  tokens  and  relations  in  the 
network  are  English  specific.  Structurally  the  network  i3  identical 
to  the  semantic  networks  of  Simmons  [10],  Unlike  Simmons’  nets, 
however,  these  syntax  netuorks  uill  connect  lexical  entries  with 
syntactic  relations.  The  syntactic  network  from  which 

"John  threw  the  ball  at  Mary" 


would  be  generated  is 
LEX 


->THR0W 

ACTSBJ 


LEX 


->i  g:  I- 


->  JOHN 


I  I. 


G1 


I  TENSE 


->PAS1  j 


LEX 


OBJ  |  i - >  BALL 

- >i  G3  |  DET 

I  _ !- . >THE 


_  PREP’ 

I  OBJ  |  |- . >  AT  _ 

- >|  G4  |  POBJ  !  |  LEX 

I _ | . -->|  G5  | - >f1ARY 


This  same  network  can  oe  more  concisely  written  as: 


LEX 

THROW 

G3: 

LEX 

BALL 

ACTSBJ 

G2 

DET 

THE 

OBJ 

G3 

1  OBJ 

G4 

G4: 

PREP 

AT 

TENSE 

PAST 

POBJ 

G5 

LEX 

JOHN 

05: 

LEX 

NARY 

The  elements  which  are  objects  of  LEX  relations  are  lexical 
entries.  The  lexical  entry  THROW  will  contain  only  morphological 
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information  such  as  PAST=THR£U.  THROW  is  NOT  a  word  sense  —  the 
same  lexical  entry  serves  for  "throw  a  boxing  match",  "throw  a 
tantrum",  and  "throw  a  ball". 

The  relations  may  seem  to  be  merely  renamings  of  the  relations 
used  by  Simmons  or  Fillmore  {21.  but  that  they  are  not  can  be  seen 
from  further  examples.  I0BJ  (indirect  object)  serves  Tar  the  "at 
dary"  in  the  above  example,  as  well  as  for  the  "to  flary"  in  "John 
gave  the  book  to  Mary"  and  the  "from  dary"  in  "John  bought  »he  book 
from  dary".  Semantic  systems  uouid  tend  to  break  down  these  10BJ 
relationships  into  SOURCE,  GOAL,  ana  other  relations. 

On  the  other  hand,  the  syntax  network  may  make  distinctions 
which  a  semantic  network  would  not.  Networks  for  the  two  sentences 

(1)  "John  wants  dary  to  sell  hint  her  Chevy" 

(2)  "John  hopes  flary  will  sell  him  her  Chevy" 

both  contain  an  embedded  structure  representing 

"dary  seil  John  dary's  Chevy;  tense=future  " 

This  embedded  structure  would  be  placed  in  the  same  relation  to 
‘want’  and  'hope'  by  most  semantic  models.  They  are  placed  in 
different  relations  in  our  syntactic  networks  because  of  the 
necessity  o?  performing  an  ‘  inf  I ni t i ve- i z i ng‘  transformation  in  (1) 
but  not  in  (2).  Such  syntactic  information  about  ‘want’  and  ‘ho|  e* 
will  not  be  processed  by  the  grammar  which  generates  from  the  syntax 
nets,  but  is  handled  by  the  routines  which  create  the  syntax  nets. 

(Note  that  hese  nets  could  bo  subjected  to  a  transformational 
process  as  are  the  syntax  trees  of  a  transformational  grammar.  This 
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Mould  result  the  production  of  paraphrases  of  the  sort  described 
as  ‘syntactic’  e.g.,  active  to  passive  voice  —  in  section  I.  No 
such  transformation, tl  process  is  incorporated  in  the  present 
program.  ) 


The  Frof'..»  l  ion  n  Sunt  fly.  Nets  fro.?  Conceptualizations 
To  produce  a  syntax  ret  from  a  conceptualization,  a  ’synthes' 
by  analysis'  process  is  undertaken.  The  conceptualization  is 
analyzed  to  detect  noteworthy  patterns  -n  the  conceptual  syntax  and 
noteuortny  relations  in  the  conceptual  semantics.  Uhile  there  are 
potentially  infinitely  man'  patterns  <_.d  -elationships  uhich  cculd 
be  detected,  only  a  finite,  and  relatively  small,  subset  of  these 
will  be  inU^es  ting  for  the  put  poses  of  generation  of  a  given 
language.  For  instance,  in  ge.nerat  ing  Enql  i  sh  from: 

PART 

0  D  | - >  *INSI0E*  « - *J0HN* 

(Cl  i  *JQHN*  <**=>  *INGEST*  * - - - i  PART 

| - <  *M0UTH*  - *J0HN* 

the  fact  that  *hILK*  is  a  FLUID  is  of  interest,  since  English  makes 
an  'EAT-QRINK'  o is, I  net  ion.  However,  in 

PAR', 

0  D  I - >  *INSIDE*  ♦ - *8EAR* 

(C2)  '♦BEAR*  <===>  *  INGEST*  — —  *F!SH*  - -  |  PART 

| - <  *fiOUTH*  - - +BEAR* 

ii  ;s  not  important  that  BEARs  are  ANiflALs  3rd  not  HUMANs.  However, 
to  generate  a  German  realization  of  (C2)  the  cl i s  action  is 
important,  since  German  makes  an  differentiation  uh;ch  English  does 
not,  (German  uses  the  verb  ‘ f,  essen’  to  describe  ez  ir  when  done  by 
an  animal,  bui  the  verb  ‘essen’  uhen  a  human  agent  is  involved.) 


Although  the  ract  that  MILK  is  a  FLUID  is  relevant  in  (Cl),  it 
is  irrelevant  in 

0  D| . >  *REFRIG* 

~3)  *J0HN*  <-*«>  *P TRANS*  -  *CUP*  < — j 

t  | - < 

CrN'T  1 
♦MILK* 

which  could  be  realized  as  "John  put  a  cup  of  milk  in  the 
refrigerator."  (Of  course,  some  representat ion  of  the  spacial 
relation  'in’,  cs  opposed  to  ’nn,  near.  .  .  must  be  added  to 
(C3> .  ) 

Thus  the  relevance  for  generation  of  a  conceptual  pattern  or 
relation  19  dependent •• 

A)  on  the  language  chosen  (examples  (Cl)  and  (C2)  ),  and 

B)  on  the  conceptual  context  in  which  it  occurs  (examples  (Cl)  and 
(C3)  ) 

HARGIE’s  language  generator,  BABEL  (Better  Analytic  Easts  for 
Encoding  in  Language),  organizes  this  sort  of  linguistic  knowledge  as 
a  set  of  discrimination  nets.  A  discrimination  net  is  a  data 
structure  consisting  of  3  binary  tree  uhoae  non-terminal  nodes  are 
associated  with  predicates  anci  uhose  terminal  nodes  are  associated 
witn  'response'  information.  In  operation,  a  discrimination  net  is 
applied  to  a  ’stimulus’--  in  this  case,  a  conceptualization.  Ths 
precLcates  in  the  ‘ree  take  the  conceptualization  as  a  parameter. 

The  algorithm  for  applying  the  discrimination  net  can  be  stated  as 
f  0 i lows: 


S0 


1.  Set  CURRENT  NODE  to  the  root  node  of  the  net. 

2.  If  CURRENT -N00E  is  a  terminal,  90  to  step  6. 

3.  evaluate  the  predicate  at  CURRENT-NODE. 

4.  If  the  value  is  TRUE,  set  CURRENT-NODE  to  its  ‘right-hand*  son 

and  go  to  step  2. 

5.  If  the  value  is  FALSE,  set  CURRENT-NODE  to  its  ‘left-hand’  son 
and  go  to  step  2. 

S.  Return  the  response  associated  with  CURRENT-NODE. 

A  portion  of  a  discrimination  net  uhich  would  find  the  response 
‘G1VE1*  for  the  stimulus: 

0  R  | - <  *MARY* 

(C4)  *J0HN*  <«««>  *ATRANS*  — —  *B00K*  — — j 

I - <  *J0HN* 

is  shoun  in  figure  1.  <ln  drawing  discrimination  nets,  root  nodes 
util  be  assigned  inaex  ‘1’:  sons  of  a  node  with  index  N  will  be 
assigned  indices  2N,  2N+-1  r 

Figure  2  trace?  the  application  of  tHe  net  of  figure  1  to 
stimulus  1C4},  following  the  discrimination  met  algorithm  given 
above. 


Si 


|  |  ACTOR 

111 

I _  |  RECIPIENT 

/  \ 

/  \ 

/  \ 

NODE  |  | 

I  2  | 

NIL  | _ | 

/  \ 

/  \ 

/  \ 


|  did  the  RECIPIENT  possess  the 
|  5  |  OBJECT  at  any  time  prior  to  the 

I _ I  TIME  of  the  conceptualization 

/  \ 

/  \ 

/  \ 


FOCUS  |  | 

I  10  I 
ACTOR  |  I 

/  \ 

/  \ 

/  \ 


I  21  |  (GI VEl » 


FIGURE  1 


CURRENT 

NODE 


(CN) 

FREOICATE 

VALUE 

ACTION 

1 

are  the  ACTOR  and  RECIPIENT  identical 

FALSE 

CN-2*CN 

is  there  no  MODE  associated  u'(th  the 
stimulus 

TRUE 

CN»-2*CN+] 

5 

did  the  recipient  possess  the  object 

FALSE 

CN.  2*CN 

at  any  time  prior  to  the  TIRE  of 
the  stimulus 


10  is  the  ACTOR  to  be  ‘focussed’  TRUE  CN*-2*CN+] 

21  none  (terminal  node) 


return  GI V 


The  predicates  within  the  discrimination  nets  may  be  separated 
into  three  distinct  classes.  Class  1  predicates  are  those  which 
perform  pattern  matching  within  the  stimulus  conceptualization. 

These  include  tests  for  the  identity  of  two  conceptual  fields,  e.g., 
the  predicate  ACTOR  *  RECIPIENT  at  node  1  of  figure  1.  Other 
predicates  in  this  class  test  for  the  presence  of  particular 
conceptual  elements  in  ‘he  stimulus  —  e.g.,  is  there  a  8  modifying 
the  RESULT  of  a  conceptual ization?  --  or  test  the  structure  of  a 
stimulus  —  e.g..  is  it  of  the  form  EVENT -CAUSE-EVENT? 

Class  11  predicates  are  logically  unnecessary  but  are  included 
for  purposes  of  keeping  the  nets  compact.  They  allow  a  single  node 
iri  one  net  to  perform  an  entire  set  of  tests  from  the  same  or  a 
different  net.  An  example  will  clarify  the  idea  behind  this.  The 
English  verb  "to  breathe",  in  its  most  common  sense,  is  represented 
conceptual iy  as 

PART 

0  D  j - <  *1N51DE*  « - X 

X  <«*»?  *INCF.ST*  * - #A1R*  - - 1  PART 

| - <  *nQUTH*  - - X 

while  to  choke  (someone  ’  is  represented  as 

X  <===>  *GRASF*  < - *NECK* 


Y 


/  \ 
I  !  I 


<  =  *=*>  * INGEST* 
8 


t 

J  PART 

Y 

0 


PART 

0  i - >*1NSI0E*  v - Y 

*AIR* - i  PART 

| - <*r>0UTH*  - Y 


The  RESULT  in  this  representat i on  of  "choke"  is  just  the 
representation  of  "breathe"  modified  by  0.  Rather  than  repeat  the 
tests  necessary  for  recognizing  '8REATHE1'  on  the  path  leading  to 
'CH0KE1',  the  predicate 
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POT.HEAO  (  <*  )  -  BREATHE1 

is  evaluated  at  some  node  of  the  discrimination  net.  Evaluation  of 
this  predicate  consists  of  testing  whether  the  structure  found  in  the 
(<s)  slot  (i.e.,  the  RESULT)  of  the  stimulus  could,  if  used  as  a 
stimulus,  evoke  the  ‘response’  BREATHE 1  (i.e.,  whether  8REATHE1  is  a 
POTential  ‘HEAD’  of  a  syntax  net  for  this  structure).  The  exact 
relationship  between  the  'responses’  in  the  discrimination  nets  and 
the  production  of  the  syntax  nets  will  be  explained  shortly.  In  this 
case  the  savings  obtained  are  not  considerable,  since  ‘BREATHE1’  does 
not  require  a  large  set  of  tests  for  its  characterization.  In  other 
cases,  however  considerable  storage  savings  result  from  this  form  of 
recursion  in  the  discrimination  process.  The  price  paid  for  the 
savings  is,  of  course,  extra  processing  time,  since  the 
di  scr iminat  ion  net  uhich  recognizes  ‘BREaTHEI’  may  make  unnecessary 
tests  in  doing  so. 

Class  III  predicates  test  properties  which  are  ‘semantic’  in 
nature.  They  all  involve  interaction  with  the  memory  model.  It  was 
shown  earlier  that  the  fact  that  *f1ILK*  is  a  FLUID  is  important  to 
the  generator  in  certain  instances.  *M!LK*,  when  it  appears  in  a 
conceptualization  is  not  an  English  word,  but  3  pointer  to  a  node  in 
memory.  And  FLUID  is  NOT  a  property  shared  by  the  English  word 
"milk"  and  the  German  "Milch”,  etc.,  but  a  property  of  the  concept 
♦MILK*.  Thus  this  information  is  not  stored  as  linguistic 
information  in  a  lexicon,  but  is  stored  in  the  memory  3nd  accessed 
through  the  node  *f1ILK*. 


54 


In  addition  to  categorical  information  of  this  sort,  the  memory 


is  the  sole  repository  of  relational  information,  such  as  BEFORE- 

AFTER  time  relationships.  Uhen  a  conceptualization  is  passed  to  the 

generator,  such  relational  information  is  not  included  unless  it  is 

specifically  desired  that  it  be  expressed.  However,  linguistic 

choices  may  be  dependent  on  this  information.  For  example, 

T1  PART 

A  HR  | - >  *CP*  •- - *T1ARY* 

(C5)  *J0HN*  <***>  *HTRANS*  -  U  - - 1  PART 

t  I - <  *CP*  « - #J0HN* 


i 


t: 

i 

a  n 

*HARY*  <•=*=>  *HTRAN3*  < - *C0NCEPTS* 

/c\ 

ill 

. >; i I  i . — >  *joy* 


*f1ARY*  ! 


t  f  j - <  *J0Y* 

INC  |  | 

i  T3  ■ 

+2 


PART 

R  |-->*CP*  < -  *MARY* 

| — <  *B00K* 


can  he  real  I  sod  as 

"  John  (tell  +  tense]  Nary  she  [like  +  tense)  reading  the  book" 
However,  if  the  generator  finds  out  (by  asking  the  memory  model)  that 

1)  T1  is  p'ior  to  *N0U*,  and 

2)  T1  is  prior  to  T2. 

then  the  realization 

"John  advised  Nary  to  read  the  book" 
i s  possible.  If.  however,  T2  is  prior  to  Tl,  then  the  sentence 
"John  told  Mary  she  would  have  enjoyed  reading  the  bock" 


55 


may  be  generated,  but  the  use  of  "advise"  is  prohibited. 

Other  linguistic  choices  are  made  on  the  basis  cf  non- 

linguisttc  context.  Making  such  choices  involves  another  form  of 

interaction  between  the  generator  and  the  memory  model.  Given  our 

ear  I i er  example: 

T1 

;  OR  | - <  *MARY* 

(C4)  *J0HN*  <««=>  *ATRANS*  *BQQK*  - —  | 

! - <  *J0HN* 

t  t 
T1  *N0U* 

it  may  be  appropriate  to  produce 

S4)  "John  gave  the  book  to  Mary"  or 

S4')  "John  returned  the  book  to  Mary" 

The  decision  is  made  on  the  basis  of  the  context  existing  in 

the  memory  at  the  time  the  generation  takes  place.  In  this  case,  the 

generator  passes  to  memory  the  question: 

3  T0.  T£<T] ,  such  that: 

T0 

i  YAL 

*B00K*  <*b£>  ♦FOSS*  * -  *MARY* 

(where  Tl.  *BOOK*,  and  *MARY*  are  the  same  pointers  as  in  the 
stimulus  (C4)  ) 

i.e.,  was  there  a  time  previous  to  Tl  at  which  the  book  was  in  Mary’s 
possession?  If  memory  answers  "ues"  to  this  ouestion,  S4’  may  be 
generated;  otherwise,  S4  will  result. 

(There  is  a  problem  here  in  deciding  whether  memory  should 
really  try  to  'find'  the  requested  information,  or  take  the  much  more 
expensive  approach  of  trying  to  ‘prove’  it.  The  answer  is  dependent 

5S 


on  the  ansuer  to  the  question  of  how  much  inference  a  memory  model 
should  do  ‘on  the  fly’  III.  It  should  be  noted,  however,  that  a 
model  uhich  permitted  multiple  representations  for  a  given  meaning 
would  have  to  adopt  a  proof  procedure  here.) 

Finally,  situations  exist  in  uhich  linguistic  considerations 
require  access  to  deductive  capabilities  of  memory  as  well  as  its 
information  retrieval  capacity.  Consider  the  conceptualization: 


(CG) 


T1 

;  n  r 

♦JOHN*  <==■>  *I1TRANS*  ► —  tt  - - 

t 


PART 

>  *CP*  < - *HARY* 

PART 

<  *CP*  « - *J0HN* 


|  *J0HN*  <»«=>  *00* 

I  /  \ 

| . >|  i  |  | . ->  *HLALTH*  *- -  (-J0) 


Nary’s  husband  I  - 

\ - 1 

t  I- . < 

T3 


T  t  T  T  f 

T1  T2  T3  Ti  *N0U* 

This  can  be  realised  as 

SG  "John  told  Nary  that  he  was  going  to  kill  her  husband." 
A  reasonable  paraphrase  might  be 

SG’  "John  threatened  to  kill  Nary's  husband." 


But  one  can  imagine  circc  tances  in  uhich  S6’  would  be  a  very 
poor  realization  and  a  much  bett>  one  would  be 
SG’ ’  "John  promised  to  kill  Nary’s  husband." 
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In  order  to  choose  between  ‘tell’,  ‘threaten’,  and  ‘promise’ 
BABEL  uses  a  predicate  HEM-QUERY.  The  distinction  is  made  on  the 
basis  of  whether  the  MQBJECT  of  the  #MTRANS*  could  cause  the 
RECIPIENT  of  the  *MTRANS*  to  become  much  less  happy  (or  much 
happier).  A  conceptualization: 

T2 

i 

* 

♦JOHN*  <*==>  *00* 

/  \ 


l - 

Mary’ s  husband  |  - 

\ - 

t 

T3 

- -  tt 

/  c\ 


>  *HEALTH*  « - 


(-10) 


1 1 1  i - >  *JQY* 

/ - 1 

♦MARY*  |  — . 


i  t  I - <  *jov* 

INC  |  T3 


(X:  K<-3) 


is  formed,  and  if  it  can  be  proved  then  ‘threaten’  is  chosen.  On  the 
other  hand,  if  this  conceptualization  with  INCrement  (X:  X>-»-3)  on  the 
resulting  state-change  can  be  proved,  then  ‘promise’  may  be  selected. 

The  memory-inference  model  in  the  present  program  is  not 
capable  of  proving  relations  of  this  complexity  —  i.e.,  whether  an 
arbitrary  conceptualization  describes  something  uhich  could  please  or 
harm  a  particular  individual.  Such  theorem  proving  is  in  fact  beyond 
the  current  capacities  of  language  processing  systems.  Our  program 
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resorts  to  human  intervention  to  answer  such  questions;  a  conceptual 
structure  like  that  above  is  typed  out  at  the  console  when  the 
program  needs  the  information  and  a  human  informant  responds  TRUE  or 
FALSE. 

It  is  important  to  realise  that  sucn  a  capability  is  not 
specific  to  a  paraphrase  program,  nor  even  to  the  subtask  of  language 
generation  in  general.  A  psychiatric  interviewing  program,  for 
example,  would  very  likely  need  the  ability  to  analyze  what  was  said 
to  it  and  determine  if  it  was  ‘threatening’,  ‘hostile’,  etc.  The 
desire  to  perform  such  an  analysis  has  nothing  to  do  with  the 
program’s  expressing  in  English  the  fact  that  what  was  said  was  a 
threat.  Since  the  need  for  such  a  capacity  can  be  justified  on 
grounds  independent  of  generation,  no  unreasonable  assumption  is 
being  made  in  making  it  available  to  the  generator,  it  demonstrates 
one  interesting  interaction  between  'inguistic  knowledge  —  that 
English  provides  a  verb  "threaten"  to  describe  3n  informatio.i 
transfer  meeting  certain  conditions  —  and  non-l inguistic  capability 
--  the  ability  to  decide  whether  a  given  piece  of  information  has 
particular  implications  in  a  particular  context. 

It  is  interesting  to  note  how  small  changes  in  3ome  conceptual 
roles  may  have  large  effects  in  the  linguistic  realization  of 
conceptualizations.  The  time  relations  in  (C5)  were  one  example. 

But  not  only  the  time  relations  are  required  for  reading  (C5)  as 
'advise';  the  identity  of  the  several  instances  of  *I1ARY*  is  also 
necessary.  Supper ?  that  the  *f1ARY*  in  ‘he  STATE-CHANGE  of  (C5)  were 
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changed  to  *J0HN*  —  i.e.,  we  had  the  'meaning ’  "John  tell  Mary  that 
Nary  read  the  hock  can  cause  John  become  happier."  No  longer  can 
♦his  be  realised  as  advise,  regardless  of  time  relations.  But  if  the 
time  relations  necessary  to  get  ‘advise’  from  (C5I  still  hold,  the 
new  meaning  could  yield  the  reading  ‘reciuest’  or  ‘ask’: 

''John  requested  that  Mary  read  the  book" 

It  is  the  job  of  the  discrimination  nets,  employing  the  three  types 
of  predicates  provided,  to  make  the  subtle  distinctions  required  for 
the  selection  of  words. 

The  core  of  BABEL  is  a  collection  of  discrimination  nets 
utilizing  these  kinds  of  predicates.  Given  a  conceptualization,  it 
is  first  necessary  to  decide  which  nets  are  applicable.  Of  course, 
all  of  them  could  be  tried,  most  of  them  failing  to  find  any 
response.  Fcr  efficiency,  a  quick  structural  analysis  is  performed 
to  determine  the  net  of  applicable  nets.  For  example,  a  stimulus 
with  the  structure  EVENT-CAUSE-STATE  CHANGE  will  be  found  to  have  two 
relevant  nets,  EKC.  which  is  specifically  for  EVfNT-CAUSE- 
STATECHANGE  structures,  and  KAUS,  which  applies  to  all  CAUSAL 
structures. 

Each  of  the  discrimination  nets  found  is  applied  to  the 
stimulus  until  a  ‘response’  is  found.  If  all  trees  are  applied 
without  a  respon'-e  being  found,  BABEL  gives  up  trying  to  express  the 
conceptualization.  If  a  response  is  found,  it  will  be  a  unit  called 
a  CONCEXICON  pointer.  As  shoun  in  Figures  i  and  2  above,  the 
CONCEXiCCN  pointer  GiVEl  may  be  found  as 


a  response  to  the  stimulus: 


o  R  I - >  *f1ARY* 

(C4)  *JOHN*  <*>  *A TRANS#  -  *BOOK*  — —  | 

t  | - <  »tJOHN* 

REF  | 

INDEF 

HIVEl  is  a  pointer  to  an  entry  in  a  data  file  called  the  CONCEXICON 
(CONCEptua!  leXICON).  An  entry  in  this  file  has  three  fields: 

CONCEXICON  ENTRY 


|  LEXICAL  POINTER  |  FRAI1EU0RK  |  SPECIAL  ACTIONS 


The  SPECIAL  ACTIONS  field  is  usually  empty.  When  present,  it 
specifies  some  form  of  transformat ion  on  the  conceptualization.  No 
examples  of  SPECIAL  ACTIONS  nil  I  be  needed  for  our  examples.  The 
other  two  fields  are  amays  pres°nt.  The  lexical  pointer  is  a 
reference  to  an  entry  in  the  lexicon;  the  pointer  for  GIVE1  is  to  the 
lexical  entry  GIVE.  Concexicon  entries  correspond  closely  to  the 
usual  notion  of  word  senses,  so  many  concexicon  entries  may  refer  to 
a  single  lexical  entry.  The  concexicon  entries  FLY1  ("to  pilot  an 
aircraft"),  FLY2  ("to  travel  by  p'ane").  and  FLY3  ("to  move  through 
the  air")  all  point  to  the  lexical  entry  FLY. 

(Of  course,  the  three  senses  have  different,  although  related, 
conceptual  representat ions.  Skeletal  forms  of  each  are  shown  below. 
FLY1 

human  <  ===->  *00* 

/  \ 

|||  0  0  | - >  locat  ion2 

plane  <===>  *PTRANS*  « - plane  * - 1 

t  j - <  locaiionl 

|L0C 

*AIR* 


El 


FLV2 


♦AIR* 

|lOC 

I  0  D  | - >  locat ion2 

plane  <=*=>  ♦PTRANS^  - - plane  - - 1 

t  | - <  locationl 

ICON! 

human 


FLY3 

♦A!  Rt 
|  LOG 

i  0  D  | - >  locations 

X  <===>  ♦PTRANS*  X  - — | 

j - <  location! 

It  is  the  job  of  the  discrimination  nets  to  find  the  particular 
pattern  present  in  a  stimulus  and  return  the  appropriate  CONCEXICON 
pointer.) 

The  FRAMEUORK  of  the  concexicon  entry  consists  of  a  list  of 
FRANEs,  where  each  FRAME  has  three  fields: 

FRAME 

i  i  *  i  i 

|  SYNTAX  RELATION  |  FIELD  SPECIFICATION  |  SPECIAL  REQUIREMENTS  i 

I _ I _ , _ ! _ I. 

The  SYNTAX  RELATION  is  a  member  of  a  fixed  set  of  relations 

which  can  occur  in  the  syntax  nets.  These  include  ACTSSJ,  OBJ, 

GBJ2,  INQ-GBJ,  INF.  and  1NF2  mentioned  earlier.  A  FIELD 

SPECIFICATION  is  a  designation  of  a  substructure  of  a 

conceptualization.  It  consists  of  a  nst  ot  elements  from  tne  set 

f  ACTOR  OBJECT  TO  FROM  <=>  <*>  <s  <eC  CON 
VAL  PART  M0BJEC1  TiME  MODE  /\  <sT  <sF  I 

These  are  the  internal  names  used  by  the  system  to  refer  to  roles  in 

conceptual  relations.  Most  correspond  closely  to  the  names  used  in 

section  II;  the  less  obvious  ones  are: 

EG 


ACTOR  refers  to  the  <ACT0R>  in  EVENTs,  the  <C0NCEPT> 

in  STAlEs  and  STATE-CHANGEs 

CON  refers  to  the  <ANTECEDENT>  in  causa!  relations, 

the  first  conceptualization  of  conjunctive 
relations 


/\ 


<s , 


<sC 


rpfer  to  the  second  conceptualization  of  a 
-on,  .active  r  elation 

refers  to  the  ;RESULf>  of  the  corresponding  type 
ot  causal  relationship 


<*F,  <sT 


re<er  to  the  initial  and  terminal  states  of  a 
statechange  relation 


The  value  of  a  r I  ELD  SPECIFICATION  (FS )  applied  to  a 


conceptualization  is  computed  a'  follows: 

1)  Set  VALUE  to  the  entire  conceptualization. 

2)  In  the  current  VALUE,  find  the  field  referred  to  by  the  first 
element  of  the  FS  (CAR  FS)  Hake  the  new  VALUE  the  conceptual 
structure  filling  this  field. 

3)  Remove  the  first  element  from  the  FS  (FS*-2DR  FS) . 


4)  If  the  FS  is  enp  y  (NULL  FS)  return  the  current  VALUE; 
otherwise,  go  to  step  2. 

If  at  any  point  a. field  sought  in  step  2  is  not  present.  NIL  is 
returned  as  the  VALUE. 

The  value  of  the  FIELD  SPECIFICATION  (H03JECT  CON  ACTOR)  applied  to 

h  R  j - >  *CP*  « - *FRE0* 


♦JOHN*  <===>  *nTRANG*  • - 


<  *CP*  *-  —  *J0HN* 


♦JOHN*  <===>  *DQ* 

/  \ 

- >j|j  j. 

i - j 

*8 ILL*  |  - 

\ - - ! 


->  *HEALTH*  - - 10 


is  the  PP  *jOHN*. 


Spec ;  a  I  rec|uirements  are  used  main'y  for  the  introduction  of 
prepositions  into  the  syntax  nets  and  will  be  described  in  an 
example. 

in  processing  a  conceptual  structure,  3  'base  node’  for  the 
syntax  net  is  created.  This  is  termed  the  ACTIVE  NODE .  Uhen  a 
concexicon  entry  for  the  structure  has  been  found  (by  applying 
discrimination  nets  to  this  CURRENT  CONCEPTUAL  STRICTURE  (CCS)  ),  the 
relation  LEX  is  attached  to  the  ACTIVE  NODE  with  its  value  being  the 
reference  in  the  LEXICAL  POINTER  field  of  the  concexicon  entry. 

Next,  all  SFcCiAL  ACTIONS  specified  in  the  concexicon  entry  are 
taken,  and  finally  the  FRAMEWORK  is  processed  3s  follows: 

(1)  Get  the  next  FRAME.  If  no  more  exist,  go  to  step  3.  Otherwise, 
add  a  node  N  to  the  network  connected  to  iue  active  node  by  the 
SYNTAX  RELATION  specified  in  the  frame.  Hake  N  the  ACTIVE  NODE, 
saving  the  old  ACTiVC  NODE,  the  CCS,  and  the  unprocessed  FRAMES 
of  the  FRAMEWORK  on  the  NOOE  PUSHOuL'N  LIST  (N°Li. 

(21  Get  a  new  CCS  by  applying  the  FIELD  SPECIFICATION  of  the  FRAME 
to  the  old  CCS.  A;  y  the  network  generation  algorithm  to  the 
new  CCS.  thereby  expanding  the  syntax  net  from  the  ACTIVE  NOOE. 
(3)  If  the  NPL  is  empty,  generation  oi  the  syntax  net  is  completed. 
Otherwise,  'pop'  the  NPL,  restoring  an  existing  node  to  ACTIVE 
NODE  status,  and  restoring  a  CCS  and  unprocessea  3et  of  FRAMEs. 
Return  to  step  1. 

The  structure  specified  by  a  FIELD  SPECIFICATION  (in  step  2) 
may  turn  out  to  be  a  conceptual  nominal.  Since  these  are  represented 


only  as  pointers  into  memory  (*f1ARY*,  *£  K*)  instead  of  by  complex 
conceptual  structures,  they  are  treated  specially  by  the  generation 
algorithm.  Specifically,  nc  discrimination  net  is  applied,  but  a 
lexical  pointer  is  found  associated  with  the  concept  in  memory  (i.e., 
the  predicate  ENGLiSH-NANE  mentioned  earlier  is  used).  Treatment  is 
entiieiy  equivalent  to  considering  the  PP  as  itself  being  a 
concexicon  entry  uith  a  regular  LEXICAL  POINTER,  but  with  no 
FRAMEWORK'  or  SPECIAL  ACTIONS. 

Consider  hou  this  process  builds  a  net  from  our  simple  example 
(C4)  above.  The  conceptualization  would  be  taken  and  recognized  as 
an  event  structure  (main  link  <«>  )  with  ACT  »  *ATPANS*.  The 
discrimination  nets  for  such  structures  would  be  applied  and  a 
concexicon  pointer,  say  ‘RECEIVE!.’,  found.  The  concexicon  entry 
RECEIVE!  consists  of: 

LEXICAL  SPECIAL 

POINTER  FRANEUORK  ACTIONS 


II  1 

RECEIVE  ||  ACTSBJ  | 
il  1 

(  TO  ) 

1 

i  NIL 

I  . 

II 

II 

.IL 

NIL 

1  1 

i  OBJ  | 

1  1 

(  OBJECT  ) 

1 

|  NIL 

1 . 

1 

1 

1  1 

|  I  OBJ  | 
i  1 

(  ACTOR  i 

1 

|  (PREP  FROm 

1 

1 

1 

An  active  node  Gi  would  be  established  with  relation  LEX  and 
value  RECElE  (the  LEXICAL  POINTER  field  of  the  entry) 

Gi:  LE«  RECEIVE 

Processing  the  first  two  frames  attaches  an  ACTSBJ  NARY  (the 


to 


lexical  pointer  associated  with  *MARY*,  found  in  the  (  ID  )  field), 
and  an  OBJ  BOOK'  I  the  lexical  pointer  associated  with  *800K*,  found  in 
the  (  OBJECT  )  field). 


Gl: 

LEX 

RECEIVE 

ACTSBJ 

C2 

OBJ 

G3 

LEX 

MARY 

G3: 

LEX 

BOOK 

The  third  frame  specifies  the  relation  IOBJ  to  be  found  in  the 
AC  I  OR  field  (  *J0HN*  ).  The  SPECIAL  REQUIREMENT  (PREP  FROM)  in  this 
FRAME  has  a  minor  transformational  effect  on  *he  standard  processing 
uhich  can  be  seen  in  the  network,  created. 


Gl: 

LEX 

ACTSBJ 

OBJ 

RECEIVE 

C>2 

G3 

G4: 

PREP 

POBJ 

FROM 

GS 

IOBJ 

■C4  ' 

• 

G5: 

LEX 

JOHN 

G2:  LEX  MARY 

G3:  LEX  BOOK 

There  exist  both  syntactic  and  semantic  elements  not  yet 
present  m  this  network.  Every  sentence  has  a  TENSE  (PAST,  PRECrnt, 
FUTuro,  PAS l  PERFect,  °tc.)t  a  FORM  (SIMple,  PROGressi ve) ,  a  M000 
( INOICat  i  ve,  INTERROGat ive,  SIBJUNCtive.  IMPerative),  and  a  VOICE 
(ACTive,  PASsive).  Only  a  simple  analysis  of  these  problems  has  been 
i ncorpcrated  into  the  current  program.  Whenever  the  LEX!  relation 
attached  to  an  ACTIVE  CODE  in  the  syntax  np‘,  has  a  ’verb’  as  its 
value,  (VERB  being  a  lexical  category),  T2\’,E,  FORM,  MOOD,  and  VOICE 
relations  must  he  attached  to  the  node.  This  information  is  derived 
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from  the  CCS  ot  the  time.  TENSE  is  chosen  from  the  set  (PRES,  PAST, 
FUTi  depending  on  uhether  the  TIDE  of  the  conceptua I i rat i on  is  *N0U*, 
before  *N‘01J*,  or  after  *N0U*.  The  TICE  of  a  conceptualization  is 
♦  alien  as  that  at  tar  tied  to  the  main  'ink  if  it  is  one  of  the  simple 
types,  or  the  TINE  of  the  ANTECEDENT  in  a  causa!  relation.  f-ORfl  is 
always  choser,  as  SINple;  extension  to  PROGressive  will  occur  when  the 
representat  ion  of  time  is  expanded  to  include  intervals  as  well  as 
points.  MOOD  is  chosen  as  INTERROG  if  a  (?)  NODE  modifies  the 
conceptualization.  SUBJUNCtive  is  chosen  for  certain  <aC  structures. 
Otherwise.  INOlCative  is  used.  VOICE  is  currently  always  chosen  as 
ACTive;  presumably  FOCUS  could  be  used  to  choose  PARsive  in  some 
i nstances. 

in  addition  to  these  relations  which  are  required  by  English 

syntax,  information  may  be  present  in  a  conceptual  structure  which 

: ;i  i  I  not  be  processed  by  the  conco icon  entry  retrieved  for  that 

structure.  Tne  modifying  relations  of  conceptual  dependency  (PART  as 

in  *NECK*  * - *MARv*  ,  REF  in  *E00<*  '» - INDEF)  each  have  a 

PART  REF 

language  specific  function  associated  u>th  them.  Uhen  these 

relations  are  noticed  on  a  cc>ncepfua!  structure,  the  corresponding 

functions  are  executed  to  modify  the  syntax  node  created  for  the  head 

of  the  structure. 

These  considerations  result  m  a  completed  syntax  net  for  our 
examp  I e ; 
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Gl: 

LEX 

RECEIVE 

G4: 

PREP 

FROM 

ACTSBJ 

G2 

POBJ 

GS 

GB  .1 

G3 

I  OBJ 

G4 

GS: 

lEX 

JOHN 

TENGE 

FAST 

FORH 

sin 

VOICE 

ACT 

nooo 

INC!  C 

G2: 

LEX 

NARY 

G3: 

LEX 

BOOK 

GET 

A 

From  this  net  the  sentence  "haru  received  a  book  from  John"  is 
generated. 

As  described  above,  the  syntax  net  created  will  always  have  a 
tree  structure,  in  actuality,  new  nodes  are  created  for  conceptual 
nominal  s  onlu  if  no  noere  already  exists  for  that  element.  Otherwise, 
another  connection  to  the  existing  nod*'  is  marie,  in  the  syntax  net 
for  "John  to'd  Mary  he  saw  Hr.  Smith"  only  one  node  for  "John"  will 
be  present,  standing  in  an  ACT5BJ  relation  to  two  different  nodes. 
The  only  use  of  thia  fact  by  the  surface  generator  is  in  the 
inclusion  of  pronouns,  and  is  thus  no*  of  great  significance. 

The  surface  generation  grammar  will  not  be  described  here.  It 
is  based  on  the  grammar  and  program  used  by  Simmons  and  described  in 
[103;  the  principal  techniques  used  do  not  differ  from  those 
described  there.  Our  grammar  generates  understandable,  but  not 
total  iy  correct.  English  sentences  from  most  of  the  syntax  nets 
created  by  the  program.  Certainly  there  is  still  a  great  deal  of 
work  to  be  done  in  this  area;  the  conceptual  approach  taken  by  HARGIE 
does  not  alleviate  most  of  the  problems  caused  by  natural  language 
syntax  in  generation. 
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MULTIPLE  REALIZATIONS 


The  process  described  in  this  last  section  demonstrates  how  a 
conceptualization  can  be  used  to  produce  a  syntax  net  which  in  turn 
can  be  used  to  produce  an  English  sentence.  Since  each  step  of  the 
process  was  deterministic,  some  additional  mechanism  is  needed  to 
produce  paraphrases,  or  multiple  realizations,  from  a  single 
conceptual izat irn. 

One  way  to  do  this  would  be  to  define  meaning-preserving 
tr  sformat  ions  on  the  syntax  nets  --  changing  VOICE  from  ACTive  to 
PASfive  would  yield  a  different  surface  strin.  But  such  syntactic 
paraphrasing  is  clearly  not  the  sou-ce  of  the  examples  given  earlier. 
Rather,  they  are  obtained  by  allowing  the  discrimination  net 
algorithm  to  finct  more  than  one  response. 

It  was  pointed  cut  earlier  that  there  may  be  more  than  one 
discrimination  net  applicable  to  a  given  stimulus.  Sometimes  more 
than  one  of  these  will  produce  a  response.  Since  the  nets  are 
organized  to  group  'related'  meanings  into  a  single  net,  however,  it 
often  i  5  the  case  that  more  than  one  appropriate  response  exists 
u  i  thin  a  single  net.  This  case  is  handled  by  the  addition  of  two 
simple  devices  to  the  discrimination  nets. 

First,  terminal  nodes  are  allowed  to  have  associated  with  them 
not  iust  .3  single  concexicen  entry,  rut  a  set  of  such  entries.  Thus 
a  ‘stimulus’  which  finds  ‘he  response  LIKE1'  may  find  ENJOY!  and 
PLEAStl  at  the  same  time.  Each  of  these  entries  may  be  used  cs  tha 
source  for  generating  3  distinct  net.  leading  to  distinct 
paraphrases. 

G3 


This  handles  cases  of  what  night  be  called  conceptual 
synonymy’.  Such  cases  do  not  explain  a  great  deal  of  paraphrase, 
houever,  and  become  rarei  as  conceptual  representat ions  are  refined. 
The  second  device  used  is  to  permit  any  terminal  of  a  discrimination 
net  to  hold,  in  addition  to  a  list  of  ccncexicon  pointers,  a  pointer 
back  to  some  node  in  the  netuork.  (This  will  be  represented  by  the 
presence  of  a  "  T  <integer>  "  at  the  terminal,  <integer>  being  the 
index  of  some  node  in  the  netuork.  Some  terminals  mau  contain  only 
such  pointers,  and  no  responses  at  all.)  In  addition  to  using  the 
responses  found  at  a  terminal,  it  is  possible  to  follow  the  pointer 
and  resume  the  net  application  process  from  the  specified  node.  More 
formally,  it  is  necessary  to  modify  the  discrimination  net  algorithm 
as  foil ous: 
add  step 

0.  set  RESPONSES  to  NIL. 
replace  step  b  in  th 

6.  add  the  responses  associated  with  CURRENT-NODE  to  RESPONSES.  If 
CURRENT -NODE  has  no  associated  pointer,  then  return  RESPONSES. 
Otherwise  set  CURRENT-NODE  to  the  node  indicated  by  the  pointer 
and  go  to  step  2. 

This  may  lead  to  the  discovery  of  ’conceptually’  distinct 
responses.  Intuitively,  this  oroce?s  corresponds  to  'ignoring'  some 
feature  of  the  stimulus  which  English  provides  a  special  way  of 
expressing  and  finding  a  more  genera!  way  of  expressing  the 
information.  The  reader  is  invited  to  apply  the  partial 
discrimination  net  of  Figure  3  to  the  stimulus 
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T1 


i  fl  R 

♦JOHN*  •■==*>  *NTRANS*  —  —  n 

f 


PART 

>  *CP*  - - *t1ARY* 

PART 

<  *CP*  < - *JOHN* 


T2 

I 

4  M 

*NARY*  <==*>  *N TRANS*  - *CONCEPTS* 

/  c\ 

Ml 

- >  ill  I - >  *JOY* 


♦MARY*  | 


t  t  |- . <  *JOY* 

INC |  T3 
+2 


t  t  t  t 

T 1  T2  T3  T1 


PART 

R  |  — >*CP*  < - *(1ARY* 

—  I 

I— <  *BOOK* 


t 

*NOU* 


following  the  modified  discrimination  net  algorithm,  and  verify  that 

the  concexicon  entries 

ADVISE1  SUGGESTl  TELL! 

are  all  found.  The  paraphrases 

"John  advised  Nary  to  read  the  book" 

"John  suggested  to  Nary  she  would  like  to  read  the  book" 
"John  told  Nary  she  would  like  to  read  the  book" 

are  generated  from  this  stimulus,  as  well  as  several  others  which 

result  from  the  conceptual  synonomy  of  UKE4,  ENJOYl  and  PLEASE1,  and 

the  paraphrase  of 

"Nary  would  like  to  read  the  book"  as 

“Nary  would  become  happier  if  she  read  the  book". 
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|  |  (FROM)  -  *CP* 

|  1  |  (ACTOR)  -  (FROM  PART) 

| _ |  (ACTOR)  -  (TO  PART) 


subtree  including  _ | _ 

REMF.MBER1,  READ1  |  | 

I  3  |  NEG  c  (MOBJECT  MODE) 


POT_HEAO(MOBJECT)  -  LIKE4 
(TIME)< (MOBJECT  TIME) 


I  G  | 


.1  I. 


subtree  including 
REFUSE 1 


(MOBJECT  CON  ACTOR) -(TO  PART)  | 


(TIME) < (MOBJECT  TIME) j  12  | 

1  1 

II 

(MOBJECT  <«  ACTOR)  -(TO  PART)  |  13  | 

1  1 

II 

1 

1 

1 

1 

1 

1 

subtree  including 

1 

1 

FOCUS  -  1  | 

1 

|  (ACTOR)-  REQUEST 1 

1 

(TO  PART) |24  ! 

I  25 
| 

|  (MOBJECT  ACTOR) 

| 

1  27 
j 

|  j 

IT 

1 

1  1 

1 

ADVISE, 

i  i 

1 

1 

tl2 

II  I  I  I  I  II 

|  48  |  |  49  |  |  50  |  |  51  | 

II  I  !  I  I  II 


TELL1 

HEAR2 

SUGGEST 1 

PROMISE! 

T4S 

T24 

t24 

FIGURE  3 

partial  discrimination  net  for  *MTRANS*  EVENTS 


rIELD  SPECIFICATION'S  in  predicates  refer  to  fields  of  an  *MTRANS*  conceptual 
stimulus.  Multiple  predicates  at  a  node  -form  a  conjunction. 
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V. 


CONCLUSION 


In  part  I  of  this  report  ue  described  a  natural  language 
processing  task  uhich  we  termed  ‘sentence  paraphrasing*.  It  uas 
claimed  that  this  t’sk  is  of  interest  for  three  major  reasons! 

1)  its  relation  to  machine  translation 

2)  the  need  to  ‘understand’  natural  language  in  order  to  produce 
paraphrases 

3)  the  effect  of  context  on  sentence  paraphrases 

MARGIE  is  a  computer  program  uhich,  given  an  English  sentence 
as  input,  can  produce  English  sentences  uhich  are  paraphrases  of  that 
input,  in  order  to  describe  its  operation  it  uas  necessary  to  define 
the  notion  of  conceptual  representation.  Section  II  enumerated  the 
properties  intrinsic  in  such  representat ions  and  gave  more  detailed 
examples  of  the  particular  representat ion  employed  by  MARGIE. 

The  production  of  paraphrases  requires  tuo  basic  processes. 

The  first  takes  English  sentences  and  produces  conceptual 
representations  of  their  meaning:  the  second  performs-  the  inverse 
operation. 

Part  III  discussed  a  conceptual  analyzer  for  English  sentences. 
The  analyzer’s  goal  uas  to  find,  for  a  given  English  sentence,  the 
conceptualization  that  represented  the  meaning  a  human  would  assign 
to  that  sentence  in  the  same  context.  The  basic  mechanism  uas  the 
request.  Uords  had  both  features  and  requests  associated  uith  them. 
Encountering  a  word  made  both  sets  available  to  the  analyzer.  A 
feature  uas  a  conceptualization.  A  request  uas  a  predicate  plus  a 
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set  of  actions.  If  the  predicate  became  true  while  the  request  uas 
still  active,  the  actions  were  performed.  The  predicates  could  ask 
questions  about  the  words  and  concepts  found  cr  about  the 
conceptualization  being  built.  The  actions  could  modify  tnat 
conceptualization  or  alter  the  set  of  requests  active.  In  general, 
verbs  and  prepositions  contributed  most  of  the  requests  while  nouns 
were  important  mainly  for  the  features  associated  with  them. 

Part  IV  detailed  the  operation  of  a  program  to  express  a 
conceptually  represented  meaning  in  English.  This  was  accomplished 
via  an  intermediate  ‘syntax  net’.  To  produce  the  net  it  was 
necessary  to  discriminate  conceptual  patterns  for  which  English 
provides  particular  verbs.  To  choose  verbs  not  only  pattern 
matching,  but  conceptual  knowledge  and  even  theorem  proving 
capabilities  were  seen  to  be  necessary.  The  discriminations  provided 
a  link  between  pure  conceptual  structures  and  units  in  a 
'concex  i  con' .  These  units  in  turn  provided  a  link  from  the 
conceptual  to  the  syntactic,  and  enabled  the  construction  of  the 
syntax  nets. 

The  paraphrase  task  did  not  motivate  either  the  analysis  or 
generation  algorithm.  The  analyzer  performs  no  operations  specific 
to  this  task.  And  it  was  shown  that  only  a  slight  modification  to  a 
general  conceptual  to  English  generator  was  needed  to  accomplish 
paraphrasing.  The  property  of  explicit  representation  of 
similarities  in  meaning  at  the  conceptual  level  makes  this  possible. 
It  also  means  that  no  manipulations  of  the  meaning  representation  are 
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needed  for  paraphrasing.  This  stands  in  contrast  to  representations 
based  on  words  or  word  senses,  which  would  require  the  application  of 
transformational  or  inference  rules  to  produce  paraphrases. 


v 
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VI. 


EXAMPLES  OF  PROGRAM  OUTPUT 


Lines  preceded  by  a  '*’  were  typed  by  the  user.  Lower  case  is  used 
for  explanatory  remarks.  All  other  lines  uere  typed  by  the  program. 

TYPE  INPUT 

*(J0HN  GAVE  MARY  A  BICYCLE)  the  input  sentence. 

OUTPUT  FROM  PARSER:  in  two  parts;  first  time  relationships; 

TIM00  :  ( (VAL  *!*))  TIM00  has  value  *T*  (analog  of  *N0U*  ) 

TIM01  :  ((BEFORE  TIM00  X))  TIM01  is  before  T1M00  by  an  unspecified  amount  X 

then  the  actual  conceptualization,  utilizing  these  times: 

((ACTOR  (JOHN)  <«>  (*A TRANS*)  OBJECT  (BICYCLE  REF  (*A*))  FROM  (JOHN)  ~ 

TO  (MARY))  TIME  (TIM01)  FOCUS  ((ACTOR)).' 


PARTIALLY  INTEGRATEO  RESULT:  this  is  the  memory  model’s  internal  format 
( (*ATRANS*  ( # JOHN 1 )  (G0004)  (ffJOHNl)  (0MARY1))  (TIME  _  (G0006) ) ) 

THINGS  TO  SAY:  this  is  the  conceptualization  after  being  internalized 
and  reconstructed  by  the  memory  model, 

((ACTOR  (JOHN)  <*>  (*A TRANS*)  OBJECT  (BIKE  REF  (*A*))  FROM  (JOHN)  TO  ~ 
(MARY))  TIME  (G00CG)  FOCUS  ((ACTOR))) 


the  syntax  net  produced  by  the  generator: 


jbj: 

(G0023) 

G0021: 

LEX 

(JOHN) 

OBJ 

(G0022) 

ACTSBJ 

(G0021) 

VOICE 

(ACT) 

G0022: 

GET 

(A) 

FORM 

(SIM) 

LEX 

(BICYCLE) 

TENSE 

(PAST) 

MOOO 

( 1  NO  I C ) 

LEX 

(GIVE) 

G8023: 

LEX 

(MARY) 

the  string  produced  by  the  surface  grammar: 

(JOHN  GAVE  MARY  A  BICYCLE)  the  ‘identity’  paraphrase  in  this  case 


more  nets  and  surface  strings: 
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G0B22: 

I  OBJ 

(G0025) 

G6024: 

DET 

(A) 

OBJ 

(00024) 

LEX 

(BICYCLE) 

ACTSBJ 

(G0023) 

VOICE 

(ACT) 

FORil 

(Slill 

G0025: 

POBJ 

(G002G) 

TENSE 

(PAST) 

PREP 

(TO! 

hood 

f INPiC) 

LEX 

(GIVE) 

G002G: 

LEX 

(MARY) 

G0023: 

LEX 

(JOHN) 

(JOHN  GAVE  A  BICYCLE  TO  MARY) 

by  having  concexicon  entries  GIVE!  and  G! V'E2.  both  with 

lexical 

pointer 

GIVE,  and  both  occur ing 

at  the 

same  terminal  of 

a 

di scriminat ion 

net,  but  with  c-| 

ightlu  d 

afferent  syntax  FRAi'iEs, 

the 

forms 

X  g_*. ,  c  v 

Z"  and  "X  gave 

Z  to  Y“ 

are  generated. 

•J0023: 

I  OBJ 

(G0026) 

G032S: 

uEX 

CHARY) 

ACTS8J 

(G0025) 

OBJ 

(C0024) 

VOICE 

(ACT) 

G002S: 

POBJ 

,G0027) 

FORM 

(S!H) 

PREP 

(FROtti 

TENSE 

(PAST) 

MOOD 

(INDIC) 

LEX 

iGtT) 

G0827: 

LEX 

(JOHN) 

G0024; 

DET 

(A) 

LEX 

(BICYCLE) 

(MARY  GOT  A  Bi CYCLE  FROM  JOHN! 

a  pointer  from  the  terming  holding  GIVE  was  followed,  having  the 
e'fect  of  ignoring  the  focus  on  the  ACTOR  UJGHN) 


G0024: 


I  OBJ 

(CO027) 

G002S: 

LEX 

(HARY) 

ACTSBJ 

(G0026) 

OBJ 

(GO025) 

VOICE 

(ACT) 

G0027: 

POBJ 

(G0028) 

FORH 

(Sim 

PREP 

(FROm 

TENSE 

(PAST) 

nooo 

(INOiC) 

LEX 

(RECEIVE) 

G0028: 

LEX 

(JOHN) 

OET 

(A) 

LEX 

(BICYCLE) 

(HARY  RECEIVED  A  BICYCLE  FROM  JOHN! 
also  ignores  the  focus  on  JOHN 


In  the  remaining  examples  only  the  input  sentence,  parser  output,  and 
paraphrase  surface  strings  will  be  shown.  The  reader  should  Keep  in 
mind  that  the  thrust  of  our  work  in  generation  has  been  toward  the 
production  of  syntax  nets  from  conceptual  structures.  No  morp  effort 
has  been  put  into  the  surface  phase  of  generation  than  was  necessary 
to  obtain  minimally  readable  realizations.  While  syntactic  problems 
in  the  resulting  surface  strings  occasionally  reflect  actual 
inadequacies  in  the  production  of  the  syntax  nets,  the  great  majority 
of  such  faults  result  from  the  primitive  transition  network  grammar 
being  employed  to  map  the  syntax  nets  into  sentences. 

TYnE  INPUT 

* ( JOHN  ADV.5ED  MARY  TO  DRINK  THE  UINE) 


OUTPUT  FROn  PARSER: 

Tlh'00  :  ( ( VAL  *T* 

TIf101  :  ((BEFORE  T1M30  X)) 
TIM02  :  ((AFTER  11031  X)) 
Tin??  :  ;  (AFTER  T 1M81  XM 


It  might  be  noteo  mat  the  times  given 
may  not  appear  in  the  output.  Some  are 
generated  at  one  point  in  the  program 
and  then  overwritten  bu  later  act. oris. 

No  information  has  been  lost  and  usually 
the  overwritten  ,ime  was  a  reference 
point.  TiH00  and  TH102  are  replaced  here. 


((ACTOR  (JOHN!)  <*>  (*i1TRANS*!  TO  (*CP*  PART  (FlARYl)  REF  (*THE*) )  FRO- 
fi  (*CP*  PART  ( J0HH1 )  REF  (*THE*))  MOBJECT  ((CON  ((ACTOR  (FlARYl )  <«>  (~ 
♦  INGEST*)  OBJECT  CUINE1  REF  (*THE*>)  TO  ulNSlDEs  PART  (NARYl))  FROH  - 
(*i10UTH*  PART  (HARvl )  >  INST  UQNE*) )  NODE  (NIL)  TIME  TTltt03)  FOCUS  ( (- 
AC  TOR  I ) )  <sC  ((ACTOR  iTiARVi )  >T  C*J0V*)  <S>F  (♦JOY*) )  INC  (2)  TIF1E  - 

nines)  node  wumi  focus  ( (actor) )  fiooe  (nil)  tine  (tifiod ) 
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paraphrases 

(JOHN  AO  VISED  NARY  TO  DRINK  SOME  U I NE » 


even  though  the  input  contained  ‘THE  UINE’,  the  memory  was  unable  to 
determine  what  ‘THE  WINE*  referred  to.  (No  nine  existed  in  the 
current  context.)  In  passing  the  conceptualization  on  to  the 
generator,  the  'definite*  reference  on  U!NE  was  changed  to 
‘indefinite’.  The  generator  expresses  the  indefinite  reference  as 
‘A*  or  ‘SONE’,  depending  on  conceptual  properties  of  the  governor. 
Since  the  concept  referred  to  by  NINE  in  the  conceptualization  is  a 
physical  substance,  but  not  an  ‘entity*,  the  modifier  SONE  is  chosen. 

(JOHN  ADVISED  MARY  TO  INGEST  S0T1E  NINE) 

(JOHN  SUGGESTED  TfJ  MARY  SHE  UOULO  LIKE  TO  DRINK  SONE  UiNE) 

(JOHN  SUGGESTED  TO  MARY  SHE  UOULO  LIKE  TO  INGEST  SOME  UINE) 

(JOHN  SUGGESTEO  TO  NARY  SHE  ORINKS  SONE  UINE  WOULD  PLEASE  HER) 

(JOHN  SUGGESTEO  TU  NARY  SHE  INGESTS  SONE  UINE  WOULD  PLEASE  HER) 

(JOHN  SUGGESTEO  TO  NARY  SHE  UOULO  ENJOY  SHE  ORINKS  SONE  UINE) 

(JOHN  SUGGESTEO  TC  NARY  SHE  UOULO  ENJOY  SHE  INGESTS  SONE  UINE) 

(JOHN  SUGGESTEO  TO  NARY  SHE  BECONES  HAPPY  IF  SHE  DRINKS  SONE  UINE) 

(JOHN  SUGGESTEO  TO  NARY  SHE  BECONES  HAPPY  IF  SHE  INGESTS  SONS  UINE) 
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TYPE  INPUT 

*<MARY  UANTr  TO  CHOKE  JOHN) 

OUTPUT  FROM  PARSER: 

TIM00  :  ( (VAL  *T*)) 

TIM01  t  ((AFTER  T IM00  X) ) 

TIM02  :  ( (AFTER  T!f100  X)) 

T I f i03  (AFTER  TIH00  X)) 

((CON  ((COM  ((COM  ((ACTOR  (flARvl )  <  =  >  l*GRASP*)  OBJECT  (*NECK*  PART  (~ 
JOHN1 ) ) )  TIME  (T IM03) >  <*  ((ACTOR  (JOHN1)  <=>  UINGEST*)  OBJECT  (*AIR~ 
*  REF  (♦A*))  FROM  (*f10UTH*  PART  (JOHNII)  TO  UINSIOE*  PART  (JOHN1)))  - 
TIME  (TIM03)  MODE  ( <*CANNQT*) ) ) )  FOCUS  (CON  ACTOR))  <sC  ( (ACTOR  (MARY- 
))  <5>T  (♦JOY*)  <s>r  (*JOY*))  INC  (2)  TIME  (TIM02) ) ) )  "«>  (*MLOC*  VAL- 
(*LTM*  PAr,T  (MARY  1 )  REF  (*THE*) ) ) )  MODE  (NIL)  FOCUS  ((•■»>  VAL  PART))- 
TIME  (TIM00) ) 


paraphrases 

(MARY  UANTS  TO  CHOKE  JOHN) 

The  original  input  is  again  the  first  realization  found.  It  is  natural 
to  organise  the  rii  scr  iminat  ion  nets  so  that  the  first  ‘response’  found 
is  the  one  which  expresses  a  ‘maximal'  conceptual  substructure.  This 
normally  results  in  the  most  concise  linguistic  expression. 

(MARY  UANTS  TO  PREVENT  JOHN  BREATHES  BY  SHE  GRABS  HIS  NECK) 

(MARY  UANTS  TO  PREVENT  JOHN  INHALES  SOME  AIR  BY  SHE  GRABS  HIS  NECK) 

(MARY  UANTS  TO  CAUSE  JOHN  IS  UNABLE  TO  BREATHE  BY  SHE  GRABS  HIS  NECK) 

(MARY  UANTS  TO  CAUSE  JOHN  IS  UNABLE  TC  INHALE  SOME  AIR  BY  SHE  GRABS 

HIS  NECK) 

(MARV  UANTS  TO  CAUSE  JOHN  NOT  CAN  BREATHE  BY  SHE  GRABS  HIS  NECK) 

(MARY  UANTS  TO  CAUSE  JOHN  NOT  CAN  INHALE  SOME  AIR  BY  SHE  GRABS  HIS 

NECK) 


(MARY  UANTS  JOHN  IS  UNABLE  TO  BREATHE  BECAUSE  SHE  GRABS  HIS  NECK) 

(MARY  UANTS  JOHN  IS  UNABLE  TO  INHALE  SOME  A t R  BECAUSE  SHE  CRABS  HIS 
NECK) 

(MARY  UANTS  JOHN  NOT  CAN  EPEATHE  BECAUSE  SHE  GRABS  HIS  NECK) 

(MARY  HOPES  SHE  CrOKES  JOHN) 
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all  the  paraphrases  of  "choke"  are  produced  again,  this  time  combined 
uith  "hope"  instead  of  "iiant". 

(MANY  THINKS  SHE  WOULD  LIKE  TO  CHOKE  JOHN) 

all  the  paraphrases  of  "like"  (seen  in  the  preceding  example) 
are  a  i  so  produced  in  the  embedded  sentence. 


TYPE  INPUT 

♦(JOHN  KILLED  MARY  BY  CHOKING  MARY) 


The  analysis  of  this  example  plus  a  graphic  equivalent  of  the  outbut 
can  be  found  in  section  ill. 

OUTPUT  FROM  PARSER: 

TIM00  :  (IVAL  *T*)) 

TIM01  :  ((BEFORE  TIM00  X)) 

TIM02  :  ((BEFORE  TIM61  X)) 

♦ 

((CON  ((CON  ((ACTOR  (J0HN1)  UGRASP*  OBJECT  UNECK*  PART  (MARY1 )~ 
))  TIME  (TIM02))  <  =  ( (ACTOR  (MARYl )  <=>  (♦INGEST*)  OBJECT  (*A]R*  REF  ~ 
(*A*) )  FROM  UMOUTH*  PART  (MARYl))  TO  (*!NSI0E*  PART  (MARYl)))  TIME  (~ 
TIM02)  MODE  i (*CANN0T*))))  FOCUS  (CON  ACTOR))  a  UCUN  ((ACTOR  ‘MARYl )~ 
<=>  (*INGEST*)  OBJECT  (*AIR*  REC  (*A*))  FROM  (*M0UTH*  PART  (MARYl))  - 
TO  UiNSIOE*  PART  (MARYl!))  TIME  (TIM02!  MODE  ( UCANNOT*) ) )  <«  ( (ACTO-v 
R  (MARYl)  <  =  >T  (*h'EAL TH*  VAL  (-10))  <s>F  (*HEALTH*  VAL  (NIL)))  MOOE  (~ 
NIL)  TIME  ( T IM01 )))))) 


paraphrases 
(JOHN  STRANGLED  MARY) 

Here  the  first  paraphrase  csces  not  match  the  input.  It  is.  in  fact, 
more  concise:  the  generator  see?  the  representation  produced  by  the 
analyrer  for  "killed  by  choking”  as  sufficient  for  use  of  the  word 
" s  t  r any ! e " . 

(JOHN  CHOKED  MARY  AND  SHE  DIED  BECAUSE  SHE  UAS  UNABLE  TO  BREATHE) 

(JOHN  CHOKED  MARY  AND  SHE  DIED  BECAUSE  SHE  UAS  UNABLE  TO  INHALE  SOME 
AIR) 

(JOHN  CHOKED  MARY  AND  SHE  DIED  BECAUSE  SHE  NOT  COULD  BREATHE) 
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(JOHN  CHOKED  MARY 

(JOHN  CHOKED  MARY 

(JOHN  CHOKED  MARY 
SOME  AIR) 

(JOHN  CHOKED  (1ARY 


AND  SHE  DIED  BECAUSE  SHE  NOT  COULD  INHALE  SOME  AIR) 

AND  SHE  BECAME  DEAD  BECAUSE  SHE  UAS  UNABLE  TO  BREATHE) 

AND  SHE  BECAME  DEAD  BECAUSE  SHE  UAS  UNABLE  TO  INHALE 

AND  SHE  BECAME  DEAD  BECAUSE  SHE  NOT  COULD  BREATHE) 


TYPE  INPUT 

* ( JOHN  TOLD  MARY  THAT  JOHN  UOULD  HIT  MARY) 

OUTPUT  FROM  PARSER: 

TIM00  :  ((VAL  *T*! ) 

T1M01  :  ((BEFORE  TIM00  X)) 

TIM02  :  ((AFTER  TIM09  X)) 

((ACTOR  ( JOHN  1 )  <=>  (*M TRANS*)  TO  (*CP*  PART  (MARY1 )  REF  (*THE*) )  FRO- 
(1  (*CP*  PART  ( JOHN 1 )  REF  (*THE*) )  M06JECT  ((CON  ((ACTOR  (JOHN1)  <«>  (- 
♦PROPEL*)  OBJECT  UHANU*  PART  (JOHN1 ) )  TO  (MARY1 )  FROM  (JOHN1)  INST  (- 
(ACTOR  (JOHN!)  <*.  (*MOVE*)  OBJECT  (*H AND*  PART  (JOHN1)))))  TIME  (TIM- 
02)  MODE  (NIL) )  <=  ((ACTOR  (*HAN0*  PART  (JOHN!))  <*>  (*PHYSCONT*  VAL  - 
(MARYl ) )  )  TIME  (T ! H02)  MODE  (NIL)  FOCl~  (CON  ACTOR* ))) )  TIME  (TIM01 ) ) 


'  paraphrases 

The  preceding  examples  wei  e  run  in  a  mode  in  which  queries  from  the 
generator  to  the  memory  which  the  memory  was  not  yet  capable  of 
handling  resulted  In  a  uniform  response  o(  FALSE.  This  example 
produces  more  interesting  results  when  run  in  a  mode  which  allows 
these  queries  to  be  answered  by  human  intervention  at  the  teletype: 


TIME  TO  PLAY  COD  —  IS  THjS  TRUE? 

(CON  ((COD  ((ACTOR  (JOHN)  <->  UPROPEL*)  OBJECT  UHAND*  REP  (*A*)  PAR*. 
T  (JOHN))  FROM  (JOHN)  TO  (HARY)  INST  ((ACTOR  (JOHN)  <->  (»nOVE»)  OBJE¬ 
CT  UHAND*  REF  UA*)  PART  (JOHN))  FRON  UONE*)  TO  UONE*) )  FOCUS  ((AC¬ 
TOR))))  TIME  (GCPOSi  FOCUS  ((ACTOR)))  <a  ((ACTOR  UHAND*  REF  UA*)  PA¬ 
RT  (JOHN))  <*>  UPHVSCONT*  VAL  (NARY)))  TIDE  (G0003))))  <sC  ((ACTOR  (- 
MARY)  <=>T  UPSTATE*!  <a>F  UPSTATE*) )  INC  (*?*  LEQUAL  (-3))  TINE  (*’* 
AFTER  (G00OS) ) ) ) 

*  1 


Thp  program  asks  (conceptually)  whether  John's  hitting  Nary  (at  time 
G000S)  could  cause  3  change  in  Nary's  position  on  the  ‘physical 
state'  scale  UPSTATE*)  at  some  time  after  G0888)  by  an  increment  < 
-3,  The  human  respondent  answered  True. 

(JOHN  THREATENED  TO  HIT  NARY) 


The  knowledge  of  the  potentially  injurious  nature  of  the  event 
communicated  by  John  allowed  the  program  to  choose  "threaten". 

'JOHN  THREATENED  TO  HIT  NARY  UI TH  HIS  HAND) 

The  'instrument'  of  the  hitting  is  normally  expressed.  When  it  is 
the  hand  of  the  ‘hitter’,  however,  it  can  be  left  off,  as  in  the 
preceding  rea I i zat i on.  ‘Hand’  was  net  present  in  the  input;  the 
analyzer  made  the  assumption  th3t  John’s  hand  was  what  he  would  hit 
Nary  with.  The  generator  assumes  other  people  use  this  default  too. 
and  thus  permits  both  realizations. 

(JOHN  TOLD  NARY  HE  HILL  HIT  HER) 


(JOHN  TOLD  NARY  HE  WILL  HI  -  HER  WITH  HIS  HAND) 


TYPE  'NPUT 

*  I  JOHN  LOANED  A  BICYCLE  TO  NARY) 


OUTPUT  FROM  PARSER: 

T  i  N00  :  ( (VAL  *T*H 
TIM01  :  ((BEFORE  1INP0  X)) 

TINP2  :  ((AFTER  11001  X!) 

((CON  ((ACTOR  (JOHN l )  <=?  UATRANS*)  TO  (NARY1 )  FROM  ( JOHN  1 )  OBJECT  U 
BIK'El  REF  (*A*I!)  TINE  (TlflOl) )  a  ( (CON  ((ACTOR  (NARYl )  <->  UATRANS*- 
I  TO  (J0HN1)  FRON  (NARYl)  OBJECT  (BIKE1  REF  (*A*>))  TINE  (T1N02))  <s>- 
UMLOC*  VAL  ULTN*  PART  ( J0HN1 ) ) ) )  FOCUS  ( (<=>  VAL  PART))  TINE  (TIN0~ 
l)))l 

The  conceptual  representation  found  is  a  conduction: 

1)  John  gave  Nary  the  pi  cycle,  and 

2)  John  believed  at  that  time  that  Nary  would  give  the  bicycle  to  him 
at  some  future  time. 


paraphrases 

TINE  10  PLAY  GOO  --  IS  THIS  TRUE7 

((ACTOR  (BIKE  REF  (*A*I  I  <h>  UPOSS*  VAL  (NARY) )  i  TINE  (*?*  BEFORE  (Go- 
0003  I ) ) 

*NIL 

the  generator  asked  whether  Nary  had  the  Bicycle  at  any  time  before 
John  loaned  it  to  her.  The  answer  given  was  "no-'L 

* 

TIME  TO  PLAY  GOO  --  IS  THIS  TRUE? 

((ACTOR  (BIKE  REP  UA*i)  <»>  UPOSS*  VAL  (JOHN)))  TINE  (*?*  BEFORE  (G- 
00^2) ) ) 

*T 

the  generatcr  as*.s  whether  John  had  the  bicycle  at  any  time  before 
the  time  at  which  he  believes  she  will  be  giving  it  to  him.  The  answer 
qiven  was  "yes". 

(JOHN  GAVE  A  BICvCLE  TO  NARr  AND  HE  EXPECTED  SHE  TO  RETURN  IT  TO  HIN) 

(JOHN  GAVE  A  BICYCLE  TO  NARY  AND  HE  EXPECTED  SHE  TO  GIVE  HIN  IT) 

(JOHN  GAVE  A  BIC-CLE  TO  NARY  AND  HE  EXPECTED  SHE  TO  GIVE  IT  TO  HIN) 

(JOHN  GAVE  A  BICYCLE  TO  NARy  AND  HE  EXPECTED  TO  GET  IT  FRON  HER) 

(JOHN  GAVE  A  BICYCLE  TO  NARY  AND  HE  EXPECTED  TO  RECEIVE  IT  PRON  HER) 


BA 


M 


iftsia 


jmigi 


1 —  = 


(JOHN  GAVE  A  BICYCLE  TO  MARY  AND  HE  THOUGHT  SHE  (JILL  RETURN  IT  TO  HIM) 


TYPE  INPUT 

* (JOHN  SOLO  A  BICYCLE  10  MARY) 

OUTPUT  FROM  PARSER: 

TIM00  :  ( (VAL  *T*)! 

TIM01  :  ( (BEFORE  TIM00  X) ) 

((CON  ((ACTOR  (J0HN1)  <=>  (*ATRaNS*)  OBJECT  (BIKE1  REF  (*A*))  TO  (MAR- 
Yl)  FROM  ( J0HN1 1 )  TIME  (TIM01 ) >  <==>  ((ACTOR  (MARY1)  <->  (*ATRANS#)  0- 
BJECT  (M0NEY1)  TO  (JOHN!)  FROM  (MARY1) )  FOCUS  ((CON  ACTOR))  TIME  (TIM- 
01)))) 


paraphrases 

(JOHN  SOIO  MARY  A  BIC.'CLE  FOR  MONEY) 

(MARY  BOUGHT  A  BICYCLE  FROM  JOHN  FOR  MONEY) 

(MARY  PAVED  JOHN  MONEY  FOR  A  BICYCLE) 

(JOHN  TRADED  MARY  A  BICYCLE  FOR  MONEY) 

(MARY  TRADED  JOHN  MONEY  FOR  A  BICYCLE) 

TYPE  INPUT 

* ( JOHN  AGGRAVATED  FRED) 

OUTPUT  FROM  PARSER: 

TIM00  :  ((VAL  *7*1) 

TIM01  :  ((BEFORE  T i MOO  XI) 

((CON  ((ACTOR  ( JOH'il  i  <  =  .>  UDO*)))  ((ACTOR  (FRE01)  <*>T  it  ANGER*)  - 
s>F  (*ANGER*)I  TIME  (TIM01I  INC  (2i >  FOCUS  ( (CON  ACTOR) )) ) 

paraphrases 
(JOHN  ANNOYED  FRED) 

(JOHN  AGGRAVATED  FRED) 


SS 


(JOHN  MADE  FRED  BECAME  ANGRY) 

(JOHN  CAUSED  FRED  BECAME  ANGRY  BY  HE  DID  SOMETHING) 
(FRED  BECAME  ANGRr  BECAUSE  JOHN  DID  SOMETHING) 
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